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ALLERGENIC PROTEINS AND PEPTIDES FROM POG DANDER 
AND USES THEREFOR 

5 pflrUfrrnund nf the Invention^ , > / 

Approximately 1 0% of the population become hypersensitized (allergic) upon 
exposure to antigens from a variety of environmental sources. Tliose antigens that induce 
immediate and/or delayed types of hypersensitivity are known as allergens (King, T. P., 
(1976) Adv. Immunol., 22: 77 - 105. These include products of grasses, trees, weeds, animal 

1 0 dander, insects, food, drugs, and chemicals. Genetic predisposition of an individual is 

believed to play a role in the development of immediate allergenic responses (Young, R. P. el 
aL, (1990) Clin. Sci. , 22: 19a) such as atopy and anaphylaxis whose symptoms include hay 
fever, asthma, and hives. 

The antibodies involved in atopic allergy belong primarily to the IgE class of 

1 5 inunimoglobulins. IgE binds to basophils, mast cells and dendritic cells via a specific, high- 
affinity receptor FceRI (Kinet, J.P., (1990) Curr, Opin, Immwnol,, 2: 499 - 505). Upon 
combination of an allergen acting as a ligand with its cognate receptor IgE, FceRI bound to 
the IgE may be cross-linked on the cell surface, resulting in physiological manifestations of 
the IgE - allergen interaction. These physiological effects include the release of, among other 

20 substances, histamine, serotonin, heparin, chemotactic factor(s) for eosinophilic leukocytes 
and/or leukotrienes C4, D4, and E4, which cause prolonged constriction of bronchial smooth 
muscle cells (Hood, L. E. et al., Immunologv (2nd ed.). The Benjamin/Cunmung Publishing 
Co., Inc. (1984)). Hence, the ultimate consequence of the interaction of an allergen with IgE 
are allergic symptoms triggered by release of the aforementioned mediators. Such symptoms 

25 may be systemic or local in nature, depending on the route of entry of the antigen and the 

pattern of deposition of IgE on mast cells or basophils. Local manifestations generally occur 
on epithelial surfaces at the site of entry of the allergen. Systemic effects can induce 
anaphylaxis (anaphylactic shock) which results from IgE-basophil response to circulating 
(intravascular) antigen. 

30 The pet dog {Canis familiaris) is kept in households the world over. In houses and 

public schools where dogs have been kept on a regular basis, dog dander allergens can be 
detected in dust samples (Wood, R. A.'£t aL, (1988) Am Rev Respir. Pis., 122: 358 - 363, 
and Dybendal, T. fiUL, (1989) AilfiffiX^ M: 401 - 41 1). The prevalence of allergy to dogs as 
assessed by skin prick test is approximately 15% (Haahtela, T. £LaL, (1981) AlkcgX, 26: 251 

35 - 256, and de Groot, H. eLaU (1991) L AUgrgy Clin. Immw^QlM £2: 1056-1065). In one 
study, sensitivity to dog allergen(s) was detected in 40% of asthmatic children, even though 
dogs were not kept as pets in their homes (Vanto, T. and Koivikko, A., (1983) APta.P^c;diatic 
S£aQd*,22: 571 -575). 
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Treatment of patients with dog allergy by administration of dog dander extracts has 
not proven to be as efficacious as treatment of cat allergic patients with cat dander extracts 
(Hedlin, G. £LaL,(1991) l Ml^rpv Clin Immunol.. fi2: 955 - 964). As with any 
desensitization scheme involving injection of increasing doses of allergen(s), there are the 
5 drawbacks of potential anaphylaxis during treatment, and the possible necessity of continuing 
therapy over a period of several years to build up sufficient tolerance that results in 
significant dinunution of clinical symptoms. 

Dog hair and dander extracts are complex mixtures containing a number of allergenic 
proteins. (Loewenstein, H sLaL, (1982) PrQCgcdings 1 1th Intgmational Congress of 
1 0 riinical Im munology, London, pp 545-548; Uchlin, T fiUL, (1 984) 

AlkEgX. 22: 125 - 133; Yman, L. £LaL, (1984) Int. Argh. Alkrpy Appl. TmTDtiTlol.U. 44: 358 - 
368; Spitzauer, S. £LaL, (1993) Tnt Arrh Allergy Immunol.. 100: 60 - 67), Two allergens 
present in dog hair/dander have been purified using immunoafSnity chromatography. A 
major allergen from dog, Can f 1 (Nomenclature according to the criteria of the lUIS (Marsh, 
15 D. G! et al. . (1988) riin. Allergy, li: 201 - 209; Agl3 according to original nomenclature), 
has been partially purified by two groups (Schou, C. eUL, (1991) Clin, and FrXP, AllgrgY> 21: 
321 -328 and de Groot sLaL, supra). Both groups, partially purified Caai I was established 
as an allergen by CRIE analysis (Ford A. W. sLsL, (1989) Clin, Exp, AllcrgV, IS: 183 - 190). 
and then rabbits or BalbA> mice were immunized to obtain polyclonal or monoclonal 
20 antibodies against the allergen, respectively. Inununoaflfinity purified CanJI (-25 kD in 
molecular weight, with a minor component -1 8 kD) which elicited a high frequency of 
positive skin prick tests among dog allergic patients was able to deplete 50 - 70% of IgE 
binding to dog dander extracts in RAST (radioallergosorbent test) analysis. While de Groot 
et al. did not attempt to detcnnine any amino acid sequence of CanJ I» Schou sLaL foimd the 
25 amino terminus of their inmiunoaffinity purified Canf I was blocked. Hence, no amino acid 
sequence ofQmSl is presently in the public domain. 

The presence of a second (minor) allergen in dog extract was detected by binding of 
IgE antibodies to dog dander/hair extracts by several groups (de Groot sLaL, supia, Schou, C. 
et al. . supra and Spitzauer et al. . (1993) Int, Argh, AlTfifPY Tmnmnol., IM: 60 - 67). The 
30 molecular weight of a minor allergen was reported to be 1 8 kD (Schou siffim), 1 9 kD 
(Spitzauer etal» supm) and 27 kd (de Groot et el. , su pra) . It is difficult however to correlate 
these results since only one group (de Groot et al. > su pra ) affinity purified an allergen 
designated Canf II (originally named Dog 2 allergen). Qm f H was purified from dog dander 
extracts in a manner analogous to Canf I using monoclonal antibodies generated to a second 
35 allergen present in extracts (de Groot et al» supraV Molecular weight of Cmf II reported by 
this group as -27 kD was later verified to be - 24 kD (Aalberse, R.C. personal 
communication). Purified Cmf II allergen was found to react with IgE of only 66% of dog 
allergic patients. In RAST analysis, Caul II allergen was able to compete with 23% of the 
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IgE directed against dog dander extract. The amino acid sequence of Can f II has not been 
^ preyiqusly'detenhined.^^'--^P-^=i /n.-^ ouun-t -.^ih-^./^.j-r -:rbi?^f>^ -...^..q,^., -v.^ 

Many patients with sensitivity to dog dander allergens are treated currently by 
administration of small, gradually increasing doses of dog dander extracts. Use of these 
5 extracts has multiple drawbacks, including potential anaphylaxis during treatment and the 
necessity of continuing therapy, often for a period of several years to build up sufficient 
tolerance and significant diminution of clinical symptoms. The ability to substitute 
compositions of at least the major dog dander allergens, such as Can f I and Cahf IL would 
overcome several of these drawbacks. Thus, a source of pure allergen that could be provided 
10 in qtiantity for use as a diagnostic or therapeutic reagent and therapeutic methods that would 
overcome the drawbacks associated with dog dander extracts are highly desirable. 

Smnmary pf thg InvCTtion 

This invention provides isolated nucleic acids encoding peptides having at least one 

1 5 biological activity of £aaf I or Caof II, protein allergens of the species Canis familiar is. 
Preferred nucleic acids are cDNAs having a nucleotide sequence shown in Figure 5 (SEQ ID 
NO: I VCanf n and Figure 18 (SEQ ID NO: 67) (CanJf II). The invention also pertains to 
peptides encoded by ail or a portion of such cDNAs (SEQ ID NO:l and SEQ ID NO: 67) and 
having at least one biological activity of £a£Lf I or Caof H. Also contemplated are isolated 

20 nucleic acids which hybridize imder high stringency conditions (e.g., equivalent to 20-27'*C 
below Tm and IM NaCl ) to a nucleic acid having a nucleotide; sequence shown in Figure 5 
(SEQ ID NO: 1) or Figure 1 8 (SEQ ID NO: 67) or which encodes a peptide comprising all or 
a portion of an amino acid sequence of Figure 5 (SEQ ID NO: 2)(£aiLf I) or Figure 1 8 (SEQ 
ID NO: 68)(£ffliXII)- Nucleic acids which encode peptides having an activity of CaiLf I or 

25 Can f II and having at least 50% homology with a sequence shown in Figure 5 (SEQ ID NO: 
2)(CaiLJf I) or Figure 18 (SEQ ID NO: 68)(£aiLf II) are also featured: Peptides having a Can 
f I or Can f II activity produced by recombinant expression of a nucleic acid of the invention, 
and peptides having a £aiif I or Caafll activity prepared by chemical synthesis are, also 
featured by this invention. Preferred peptides have the ability to induce a T cell response, 

30 which may include T cell stimulation (measured by, for example, T cell proliferation or 

cytokine secretion) or T cell nonresponsiveness (i.e., contact with the peptide or a complex of 
the peptide with an MHC molecule of an antigen presenting cell induces the T cell to become 
unxesponsive to stimulatory signals or incapable of proliferation). Other preferred peptides, 
either apart from or in addition to the ability to induce a T cell response, have the ability to 

35 bind the dog dander specific IgE of dog dander-allergic subjects. Such peptides are useful in 
diagnosing sensitivity to dog dander in a subject. Still other peptides, either apart from or in 
addition to the ability to induce a T cell response, have a significantly reduced or negligible 
ability to bind dog dander-allergic IgE. Such peptides are particularly useful as therapeutic 
agents. 



wo 94/160£8 



FCT/US93/12468 



Other preferred peptides comprise an amino acid sequence shown in Figure 5 (SEQ 
ID NO: 2) (CaiLf I) or Figure 18 (SEQ ID NO: 68)(CailJfn)- In one embodiment, peptides 
having a CaaJl or CaaLf H activity and comprising a portion of the amino acid sequence of 
Figure 5 (SEQ ID NO: 2) or Figure 18 (SEQ ID NO: 68) are at least about 8-30 amino acids 
5 in length, preferably about 10-20 amino acids in length, and most preferably about 10-1 6 

amino acids in length. 

Another aspect of the invention features antibodies specifically reactive with a 
peptide having a CaiLf I or Caof H activity. A peptide having an activity of Caaf I or Cml 
II can be used in compositions suitable for pharmaceutical administration. For example, such 
1 0 compositions can be used in a manner similar to dog dander extracts to treat or prevent 
allergic reactions to dog dander in a subject Nucleic acids of tiie invention and peptides 
having an activity of CaaJI or Caol H can be also used for diagnosing sensitivity in a 
subject to a dog dander. 

15 ftr'''*^T>g''g"n filtn ff*"***" Drawings 

Figure 1 shows degenerate primer pairs based on residues 9-15 and 30-37 of the 

mature CaiLf I pr»tein used in the MOPAC technique of PGR amplification. Two internal 

degenerate oligonucleotide probes based on Canil protein residues 17-24 (Dog Probe 1) and 

88-94 (Dog Probe 2) are shown. 
20 Figure 2 shows oligonucleotide primers used in a RACE PGR protocol to obtain the 

3' portion of the Caoil cDNA. A degenerate oligonucleotide probe (Dog Probe 4) is also 

shown. 

Figure 3 shows primers used in an anchored PGR technique to determine the 5' end of 
the Canf l cDNA. A degenerate oligonucleotide probe. Dog Probe 0, based on residues 9-17 

25 of the Canf protein is shown. 

Figure 4 is a schematic representation of a PGR sequencing strategy used to obtain 
tiie sequence of the mature CfflLf 1 protein from botii strands of amplified cDNA. 

Figure 5 is the cDNA sequence and deduced amino acid sequence of Caaf I. 
Figure 6 is a schematic representation of the strategy used to express Caaf I 
30 recombinant protein in bacteria. 

Figure 7 is a schematic representation of tiie strategy used to express £an_f I 
recombinant protein in a mammalian cell using the pJ7L expression vector. 

Figure 8 is a schematic representation of the strategy used to insert a His6 reporter 
group at the carboxy terininus of the recombinant Caail protein to aid purification of the 
35 protein. 

Figure 9 shows tiie aUgnment of tiffce partial 3" £anJf I cDNA sequences (Caaf I, 
2CaiLf I and ^ Gan f I). An (*) indicates tiiat the position in the alignmei»t is perfectly 
conserved and a (.) indicates that tiie position is well conserved. A (-) was inserted where 
necessary for purposes of alignment. 
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Figure 10 is a graphic representation depicting the response of T cell lines from 
patients primed in vUro with recombinant Can f I (rCaif I) and analyzed for response to 
rCan f I and various peptides derived from CanX 1 by positivity index (% of patients who 
positively responded multiplied by the mean stimulation index). 
5 Figure 11 is a graphic representation of a direct binding assay of IgE from a single 

dog allergic patient to bacterially expressed recombinant CaiLf I* 

Figure 12 shows Western blot analysis of four protein preparations (Lane 1 : dog hair 
extract; Lane 2: dog saliva; Lane 3: bacterially expressed recombinant CaiLf I; and Lane 4: 
recombinant Can f I expresised in a manuhalian cell culture system) probed with plasma from 
1 0 a dog allergic patient (#90 1 ) or with plasma from a negative control patient (#250). 

Figure 1 3 shows the design of primers based on a partial amino acid sequence of 
mature Can f II and the sequence strategy for Can f H. ( ) denotes residues which were not 
determined. 

Figure 14 is a schematic representation of the strategy used to clone Can f II cDNA. 
15 Figure 15 is a schematic representation of the strategy used to clone the 5* (A) and 3* 

(B) portions of Can f II cDNA flanking the sequence encoding a portion of the amino acid 
sequence (shaded) of native Caof IL 

Figure 16 is the nucleotide sequence of primers used in cloning ^laaf II cDNA. 
Figure 17 shows the sequence strategy used to determine the nucleotide sequence of 
20 the Can f II cDNA clones la, Ic and Ij. The figure depicts inserts of the cDNA clones la 
(793 bp), Ic (791 bp) and Ij (774 bp). The hatched bars represent coding sequence. The 
triangles indicate the position of an initiator methionine codon (ATG); the codon specifying 
the N-terminal amino acid residue of the mature protein (START); the position of a 
termination codon (STOP); and the position of a polyadenylation signal (As). The arrows 
25 indicate the extent and direction of the sequencing reactions. 

Figure 18 is the cDNA sequence and deduced amino acid sequence (from Clone Ic) 
of £aiLf II . 

Figure 19 is a comparison of the deduced amino acid sequence of Can f II based on 
the cDNA sequence of clone Ic anid a portion of the native Can f II determined by protein 
30 sequencing of the N-terminus. The amino acid residues of a signal peptide are numbered -1 9 
to-1. 

Figure 20 shows northern analysis of mRNA of different dog tissues. Total cellular 
RNA (25 mg) from dog tongue epithelial tissue, parotid salivary gland, skin, mandibular and 
submaxillary glands, liver and spleen was subjected to Northern analysis using QmlU 
35 cDNA as a probe. The position of RNA markers are indicated in kilobases (kb). 

Figure 21 is a comparison of the amino acid sequence of £siLf II with homologus 
proteins MUP 6 Mouse and Rat A2U. The alignment was made with Gene Works program. 
The signal sequences are underlined. Anuno acid residues which are identical in all three 
proteins are boxed. 
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Figure 22A-C are graphic representations of a direct binding assays of human IgE 
binding to native Can f II and recombinant iilan-f II. 

Figure 23 is the nucleotide consensus sequence among cDNA clones la, Ic and Ij 
encoding partial or fiiU length Canf IL 
5 Figure 24 is a graphic representation depicting the response of T cell lines from 12 

patients primed in vitro with recombinant Call I (rCaiLf I) and analyzed for response to 
rCan f I and various peptides derived from Can f I by stimulation index. A stimulation index 
equal to or greater than two times the background is considered "positive." 

Figure 25 is a graphic representation depicting the response of T cell lines from 12 
10 patients primed in vitro with recombinant CanJ'I (rCaO-f I) and analyzed for response to 

rCan f I and various peptides derived from Can f I by mean stimulation index for the group of 
patients with positive responses to the peptides. 

netfliled Description of the Invention 

15 This invention pertains to isolated nucleic acids encoding peptides having at least one 

biological activity of Can f I or £aiLf II, allergens of the species Canis familiaris. 
Preferably, the nucleic acid is a cDNA comprising a nucleotide sequence shown in Figure 5 
(SEQ ID NO: 1 ) (Caof I) or Figure 1 8 (SEQ ID NO:67) (CaiLf II). 

The cDNA shown in Figure 5 (SEQ ID NO:l) encodes a CaiLf I peptide which 

20 includes a 26 amino acid leader sequence encoded by base 1 through base 78. This leader 
sequence is not found in the mature CaiLf I protein, which is encoded by bases 79 through 
525. The deduced amino acid sequence of £aiLf I based on this cDNA is also shown in 
Figure 5 (SEQ ID NO: 2). The cDNA encodes a mature peptide having a predicted 
molecular weight of 19.2 kDa, with a pl of 5.53 and a single potential N-linked glycosylation 

25 site. A culture of £. coli transfected with an expression vector containing the cDNA 

encoding Can f I was deposited under the Budapest Treaty with the American Type Culture 
Collection on December 22, 1992 and assigned accession number 69167. 

The cDN A shown in Figure 1 8 (SEQ ID NO: 67) encodes a Can f II peptide which 
includes a 19 amino acid leader sequence encoded by base 195 through base 251. This leader 

30 sequence is not found in the mature £aaf H protein, which is encoded by bases 252 through 
734. The deduced amino acid sequence of Can f II based on this cDNA is shown in Figure 
1 8 (SEQ ID NO: 68). The cDNA encodes a Caaf II peptide having a predicted molecular 
weight of 1 8.229 kDa, with a pi of 4.54 for a mature recombinant Can f II protein and a pl of 
4.44 for a full length (including singal sequence), recombinant CaiLfll protein, and a single 

35 potential N-linked glycosylation site. N-linked glycosylation may increase the molecular 
weight of the peptide and may alter the pl of the mature protein, A culture of £. coli 
transfected with an expression vector containing the cDNA encoding Cai^f II was deposited 
imder the Budapest Treaty with the American Type.Culture Collection on December 29, 
1993 and assigned accession number XXXX. 



wo 94/16058 



PCT/US93/U468 



c ! > w T Accordingly /one aspect of this myention. pertains ttoisolatodj^ acids comprising 
nucleotide sequences encoding Qml I or CmS II» fragments thereof encoding peptides 
having at least one biological activity of £aaJI or CaolU, and/or equivalents of such 
nucleic acids. The term nucleic acid as used herein is intended to include such fragments and 
S equivalents. The term equivalent is intended to include nucleotide sequences encoding 

functionally equivalent Caaf I or £aiLf II proteins or functionally equivalent peptides having 
an activity of Can f I or Caof II. As defined herein, a peptide having an activity of Canjf I or 
Can f II has at least one biological activity of the Caafl or CaO-fH allergen. Equivalent 
nucleotide sequences will include sequences that differ by one or more nucleotide 

1 0 substitutions^ additions or deletions, such as allelic variants, and will also include sequences 
that differ from the nucleotide sequence encoding Can f l or CaiLf H shown in Figure 5 (SEQ 
ID NO: 1) or Figure 18 (SEQ ID NO: 67) due to the degeneracy of the genetic code. 
Equivalents will also include nucleotide sequences that hybridize under stringent conditions 
(i.e., equivalent to about 20-27''C below melting temperature (Jm) and about IM salt) to the 

15 nucleotide sequence of Caafl shown in Figure 5 (SEQ ID NO: 1) or Caai'II shown in 
Figure 18 (SEQ ID NO: 67). 

Peptides referred to herein as having an activity of Can f I or Can f II or having a Can 
f I or Can f II activity are defined herein as peptides that have an amino acid sequence 
corresponding to all or a portion of the amino acid sequence of Can f I or Can f II shown in 

20 Fiugre 5 (SEQ ID NO: 2) or Figure 1 8 (SEQ ID NO: 68) which peptide has at least one 

biological activity of £aall or £aZLf II- For example, a peptide having an activity of Can f 1 
or Can f II may have the ability to induce a response in Canjf I or £aaf II restricted T cells 
such as stimulation (e.g., T cell proliferation or cytokine secretion) or to induce T cell non- 
responsiveness. Alternatively, or additionally, a peptide having an activity of Can f I or Can 

25 f II may have the ability to bind (to be recognized by) immimoglobulin E (IgE) antibodies of 
dog dander-allergic subjects. Peptides which bind IgE are useful in methods of detecting 
allergic sensitivity to Qmll or CanJ'II in a subject. Peptides that do not bind IgE, or bind 
IgE to a lesser extent than a purified, native Can f I or Can f II protein binds IgE are 
particularly useful as therapeutic agents. 

30 In one embodiment, the nucleic acid is a cDNA encoding a peptide having an activity 

of CaaXI or Can-f II- Preferably, the nucleic acid is a cDNA molecule comprising at least a 
portion of the nucleotide sequence encoding Can f I or Caof II, shown in Figure 5 (SEQ ID 
NO:l) and Figure 18 (SEQ ID NO: 67). A preferred portion of the cDNA molecules of 
Figure 5 and Figure 1 8 includes the coding region of the molecule. 

35 In another embodiment, the nucleic acid of the invention encodes a peptide having an 

activity of QsdJI or CsnSU and comprising an amino acid sequence shown in Figure 5 
(SEQ ID N0:2) (Cmil I) or Figure 18 (SEQ ID NO: 68) (£aiLf II), Preferred nucleic acids 
encode a peptide having aQsnSl or Caojf II activity and having at least about 50% 
homology, more preferably at least about 60% homology and most preferably at least about 
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70% homology with the sequence shown in Figure 5 (SEQ ID NO: 1) (CaiLf I) or Figure 1 8 
(SEQ ID NO: 67) (Can-f ID- Nucleic acids which encode peptides having a Qmf I or Qm f 
II activity and having at least about 90%, more preferably at least about 95%, and most 
preferably at least about 98-99% homology with a sequence set forth in Figure 5 (SEQ ID 
5 NO: 2) (Can f l) or Figure 18 (SEQ ID NO: 68) (Caof II) are also within the scope of the 
invention. Homology refers to sequence similarity between two peptides having an activity 
of Canf I or Canjf H or between two nucleic acid molecules. Homology can be determined 
by comparing a position in each sequence which may be aligned for purposes of comparison. 
When a position in the compared sequence is occupied by the same base or amino acid, then 

10 the molecules are homologous at that position. A degree of homology between sequences is 
a function of the number of matching or homologous positions shared by the sequences. 

Another aspect of the invention provides a nucleic acid which hybridizes imder high 
or low stringency conditions to a nucleic acid which encodes a peptide having all or a portion 
of an amino acid sequence shown in Figure 5 (SEQ ID NO: 2) (Canf I) or Figure 18 (SEQ 

1 5 ID NO: 68) (C^Lf II). Appropriate stringency conditions which promote DNA 

hybridization, for example, 6.0 x sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by a wash of 2,0 x SSC at 50** are known to those skilled in the art or can be found 
in riirreTit Protomlfs in Mole cular Riology. John Wiley & Sons, N.Y. (1989), 6,3.1-6.3.6. 
For example, the salt concentration in the wash step can be selected from a low stringency of 

20 about 2.0 x SSC at 50**C to a high stringency of about 0.2 x SSC at SO^'C. In addition, the 
temperature in the wash step can be increased from low stringency conditions at room 
temperature, about 22°C, to high stringency conditions at about 65°C. 

Isolated nucleic acids encoding peptides having an activity of Cmf I or Caof II, as 
described herein, and having a sequence which differs from the nucleotide sequences shown 

25 in Figure 5 (SEQ ID NO: 1) and Figure 18 (SEQ ID NO: 67) due to degeneracy in the genetic 
code are also within the scope of the invention. Such nucleic acids encode functionally 
equivalent peptides (i.e., a peptide having an activity of Caof I or Caaf II) but differ in 
sequence from the sequences of Figure 5 and Figure 18 due to degeneracy in the genetic 
code. For example, a number of amino acids are designated by more than one triplet. 

30 Codons that specify the same amino acid, or synonyms (for example, CAU and CAC are 
synonyms for histidine) may result in "silent" mutations which do not affect the amino acid 
sequence of the CaiLf I or Can f 11 protein. However, it is expected that DNA sequence 
polymorphisms that do lead to changes in the amino acid sequence of f I or CftTTt f H will 
exist within the dog dander popidation. One skilled in the art will appreciate that these 

35 variations in one or more nucleotides (up to about 3-4% of the nucleotides) of the nucleic 
acids encoding peptides having an activity of CanJfl or Caaf II may exist among individual 
pet dogs due to natural allelic variation. Any and all such nucleotide variations and resulting 
amino acid polymorphisms are within the scope of this invention. Furthermore, there may be 
one or more isoforms or related, cross-reacting family members of Can f I or £aiLf II. Such 
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isofbrms or family members are^defi^ and amino a^^^ 

sequence to Canf l or Caul H, but encoded by genes at different loci. 

Fragments of the nucleic acid encoding Cailf I or CaOLf H are also within the scope of 
the invention. As used herein, a fragment of the nucleic acid encoding CanJ I or £an-f H 

5 refers to a nucleotide sequence having fewer nucleotides than the nucleotide sequence 

encoding the entire amino acid sequence of Caaf I or CaiLf H protein and which encodes a 
peptide having an activity of Caail or CaiLf II (i.e., a peptide having at least one biological 
activity of the Caof I or CaafU allergen) as defined herem. 

Preferred nucleic acid fragments encode peptides of at least about 1 0 amino acid 

1 0 residues in length, preferably about 1 0-20 amino acid residues in length, and more preferably 
about 12-16 amino acid residues in length. Nucleic acid fragments which encode peptides 
having a pan f I activity of at least about 30 amino acid residues in length, at least about 40 
amino acid residues in length, at least about 60 amino acid residues in length, at least about 
80 amino acid residues in length, at least about 100 amino acid residues in length, and at least 

1 5 about 1 40 residues in length or more are also within the scope of this invention. Nucleic acid 
fragments which encode peptides having a CaoiU activity of at least about 30 amino acid 
residues in length, at least about 40 amino acid residues in length, at least about 60 amino 
acid residues in length, at least about 80 amino acid residues in length, at least about 1 00 
amino acid residues in length, at least about 140 residues in length, and at least about 160 

20 amino acid residues in length or more are also within the scope of this invention. 

Nucleic acid fragments within the scope of the invention include those capable of 
hybridizing imder high or low stringency conditions with nucleic acids from other animal 
species for use in screening protocols to detect Caof I or £ml II or allergens that are cross- 
reactive with Cmf I or Canf IL Generally, the nucleic acid encoding a peptide having an 

25 activity of Canf l or CaofH will be selected from the bases encoding the mature protein, 
however, in some instances it may be desirable to select all or part of a peptide from the 
leader sequence portion of the nucleic acids of the invention. Nucleic acids within the scope 
of the invention may also contain linker sequences, modified restriction endonuclease sites 
and other sequences useful for molecular cloning, expression or purification of recombinant 

30 peptides having an activity of Caoi" lor CaU-f II. 

A nucleic acid encoding a peptide having an activity of £aaf I or £anJIl may be 
obtained from mRNA present in salivary glands or other organs of the pet dog Canis 
familiaris. It should also be possible to obtain nucleic acids encoding CanJ* I or CaoX D 
from Cams familiaris genomic DNA. For example, the gene encoding £aaf I or £a!Lf II can 

.35 be cloned from either a cDN A or a genomic library in accordance with protocols herein 

described. A cDNA encodmg Canfl or CmJU can be obtamed by isolating total mRNA 
from Canis familiaris. Double stranded cDNAs can then be prepared from the total mRNA. 
Subsequently, the cDNAs can be inserted into a suitable plasmid or bacteriophage vector 
using any one of a number of known techniques. Genes encoding Cffl.f I or Qmf U can also 
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-lo- 
be cloned using established polymerase chain reaction techniques in accordance with the 
nucleotide sequence information provided by the invention. The nucleic acids of the 
invention can be DNA or RNA. A preferred nucleic acid is a cDNA encoding Can f I or Can 
f II having the sequence depicted in Figure 5 (SEQ ID N0:1) (£aiLf I ) or Figure 1 S (SEQ ID 

5 NO:67)(CaiLfn). 

This invention also provides expression vectors containing a nucleic acid encoding a 
peptide having an activity of Can f I or CaiLf II, operably linked to at least one regulatory 
sequence. Operably linked is intended to mean that the nucleotide sequence is linked to a 
regulatory sequence in a maimer which allows expression of the nucleotide sequence. 

0 Regulatory sequences are art-recognized and are selected to direct expression of the peptide 
having an activity of Can f I or CaO-f II- Accordingly, the term regulatory sequence includes 
promoters, enhancers and other expression control elements. Such regulatory sequences are 
described in Goeddel; Cgng Expression Twlinplpgy: MgthQds in Enzymology l&l. 
Academic Press, San Diego, CA (1990). It should be understood that the design of the 

5 expression vector may depend on such factors as the choice of the host cell to be transformed 
and/or the type of protein desired to be expressed. In one embodiment, the expression vector 
includes a DNA encoding a peptide having an activity of CanJ* I or Can f 11. Such expression 
vectors can be used to transfect cells to thereby produce proteins or peptides, including fusion 
proteins or peptides encoded by nucleic acids as described herein. 

0 This invention further pertaixis to a host cell transfected to express a peptide having an 

activity of Can f 1 or Can f IL The host cell may be any procaryotic or eucaryotic cell. For 
example, a peptide having an activity of £aaf I or QsnIU may be expressed in bacterial 
cells such as £1 co//, insect cells (baculovinis), yeast, or mammalian cells such as Chinese 
hamster ovary cells (CHO). Other suitable host cells can be found in Goeddel, (1990) supra 

5 or known to those skilled in the art. 

Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead to 
partial or complete glycosylation and/or formation of relevant inter- or intra-chain disulfide 
bonds of recombinant protein. Examples of vectors for expression in yeast S, cerMsae 
include pYepSecl (Baldari. et al. . (1987) EmhfiJL, fi: 229 - 234), pMFa (Kurjan and 

0 Herskowit2,(1982)£dl,30:933-943),pmY88(SchultZ£Ld*,(1987)ikn£.54: 113 - 123), 
and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovinis vectors available for 
expression of proteins in cultured insect cells (SF 9 cells) include the pAc series (Smith et al. . 
(1983) Mol. Cell Biol. . 1: 2156-2165) and the pVL series (Lucklow, V.A., and Summers, 
M.D., (1989) Virologv . 170: 31 - 39). Generally COS cells (Gluzman, Y., (1981) Cdl, 21: 

5 175 - 1 82) are used in conjunction with such vectors as pCDM 8 (Aniffo, A. and Seed, B., 
(1987) Proc. Natl. Acad. Sci. USA , M: 8573 - 8577) for transient amplification/expression in 
mammalian cells, while CHO (dhfr- Chinese Hamster Ovary) cells are used with vectors such 
as pMT2PC (Kaufman et al. . (1987) EMBO J. , fi: 187 - 195) for stable 
amplification/expression in mammalian cells. Vector DNA can be introduced into 
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mammalian cells via conventional techniques such as calcium phosphate or calcium chloride 
co-precipitation, DEAE-dextran-nriediated transfection, or electroporation. Suitable methods 
for transforming host cells can be found in Sambrook £L3L> (Molecular Cloning: A 
T^ft ^nratnrv Manual . 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and other 
5 laboratory textbooks. 

Expression in procaryotes is most often carried out in E. coli with either fusion or 
non-fusion inducible expression vectors. Fusion vectors usually add a number of NH2 
terminal amino acids to the expressed target gene. These NH2 terminal amino acids often are 
referred to as a reporter group. Such reporter groups usually serve two purposes: 1) to 

1 0 increase the solubility of the target recombinant protein; and 2) to aid in the purification of 
the target recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the reporter 
group and the target recombinant protein to enable separation of the target recombinant 
protein from the reporter group subsequent to purification of the fusion protein. Such 

1 5 enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and 

enterokinase. Typical fusion expression vectors include pGEX (Amrad Corp., Melbourne, 
Australia), pMAL (New England Biolabs. Beverly, MA) and pRIT5 (Pharmacia, Piscataway, 
NJ) which fuse glutathione S-transferase, maltose E binding protein, or protein A, 
respectively, to the target recombinant protein. 

20 Inducible non-fusion expression vectors include pTrc (Amann et al.. (1 988) £kns, 62: 

301 315) an^ f^'^ (Rtiirfiftr f>t a1., Gene Expression Technologv: Methods in 
Rn^mology . 185 . Academic Press, San Diego, California (1 990) 60-89). While target gene 
expression relies on host RN A polymerase transcription from the hybrid trp-lac fusion 
promoter in pTrc, expression of target genes inserted into pET 1 Id relies on transcription 

25 from the T7 gnl 0-lac 0 fusion promoter mediated by coexpressed viral RNA polymerase (T7 
gnl). This viral polymerase is supplied by host strains BL21 (DE3) or HMSl 74(DE3) from a 
resident X prophage harboring a T7 gnl under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant Caaf I or £aaf II expression m £. coli is to 
30 express the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Qgng ExprCHSion Tgchnology: Mgthods in 
Enzymologv . 185 , Academic Press, San Diego, California (1 990) 1 1 9-128). Another strategy 
would be to alter the nucleic acid encoiding the Canf I or Canf II protein to be inserted into 
an expression vector so that the individual codons for each amino acid would be those 
3 5 preferentially utilized in highly expressed E. coli proteins (Wada et al., (1 992) Nyc, Acjjdg 

Res. . 20: 21 1 1 - 21 1 8). Such alteration of nucleic acids of the invention can be carried out by 
standard DNA synthesis techniques. 

The nucleic acids of the invention can also be chemically synthesized using standard 
techniques. Various methods of chemically synthesizing polydeoxynucleotides are known, 
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including solid-phase synthesis which, like peptide synthesis, has been fully automated in 
commercially available DNA synthesizers (Sfifi e.g., Itakura sLaL, U.S. Patent No. 4,598,049; 
Caruthers et al.. U.S. Patent No. 4,458,066; and Itakura, U.S. Patent Nos. 4,401 ,796 and 
4,373,071, incoiporated by reference herein). 
5 The present invention further pertains to methods of producing peptides that have an 

activity of Can f I or CaniH. For example, a host cell transfected with a nucleic acid vector 
directing expression of a nucleotide sequence encoding a peptide having an activity of Caaf I 
or Cmf n can be cultured under appropriate conditions to allow expression of the peptide to 
occur. The peptide may be secreted and isolated from a mixture of cells and medium 
10 containing the peptide having an activity of Caaf I or CaiLf H. Alternatively, the peptide 
may be retained cytoplasmically and the cells harvested, lysed and tiie protein isolated. A 
cell culture includes host cells, media and other byproducts. Suitable media for cell culture 
are well known in tiie art. The peptide having an activity of Caof I or Cani H can be 
isolated from cell culture medium, host cells, or botii using techniques known in tiie art for 
1 5 purifying proteins including ion-exchange chromatography, gel filtration chromatography, 
ultrafiltration, electrophoresis, and immunoaffmity purification witii antibodies specific for a 
peptide having an activity of £aiLf I or Caaf 11. 

Another aspect of tiie invention pertains to isolated peptides having an activity of Can 
f I or Caaf II. A peptide having an activity of CaiLf 1 or £ani" H has at least one biological 
20 activity of tiie Canf l or Can! H aUergen. For example, a peptide having an activity of CaiLf 
1 or Can f II may have the ability to induce a response in £aaf I or CaofH specific T cells 
such as stimulation (T cell proliferation or cytokine secretion) or to mduce T cell non- 
responsiveness. In one embodiment, a peptide having an activity of CaiLf I or Caof H 
stimulates T cells as evidenced by, for example, T cell proliferation or cytokine secretion. In 
25 anotiier embodiment, peptides having a CaiLf I or Caof U activity induce T cell non- 
responsiveness in which T cells are unresponsive to a subsequent challenge with a CaiLf 1 or 
Can f II peptide following exposure to the peptide. In yet another embodiment, a peptide 
having a Caofl or CaiLf n activity has reduced IgE binding activity compared to purified, 
native Caaf I or Can f II protein. A peptide having an activity ofCsnSl or QmfH may 
30 differ in amino acid sequence from tiie Canil or Caof H sequence depicted in Figure 5 

(SEQ ID NO:2) (CaLf I) or Figure 1 8 (SEQ ID NO: 68) (Caaf H) but such differences result 
in a modified protein which functions in the same or similar manner as a native Cm fl or 
Can f II protein or which has the same or similar characteristics of a native Caaflor Caaf II 
protein. Various modifications of the Caafl or Can f II protein to produce tiiese and otiier 
35 functionally equivalent peptides are described in detail herein. The term peptide, as used 
herein, refers to pq)tides, proteins, and polypeptides. 

A peptide can be produced by modification of the amino acid sequence of the Caaf I 
or Caaf n protein shown in Figure 5 (SEQ ID NO: 2) (Caaf I) or Figure 18 (SEQ ID NO: 
68) (Caaf II). such as a substitution, addition, or deletion of an amino acid residue which is 
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^*?^not'directiy involved in^the.fimctionioftthe p invention can be at least 

about 10 amino acid residues in length, preferably about 10-20 amino acid residues in length, 
and more preferably about 10-16 amino acid residues in length. Peptides having an activity 
of CaaXI or Cao-fll and which are at least about 30 amino acid residues in length, at least 
S about 40 amino acid residues in length, at least about 60 amino acid residues in length, at 
least about 80 amino acid residues in length, and at least about 1 00 amino acid residues in 
length are also included within the scope of this invention. 

Another embodiment of the invention provides a substantially pure preparation of a 
peptide having an activity of CsoLf I or £aaJIL Such a preparation is substantially free of 

1 0 proteins and peptides with which the peptide naturally occurs (i.e., other canine peptides), 
either in a cell or when secreted by a cell. 

The term isolated as used herein refers to a nucleic acid or peptide that is substantially 
free of cellular materia] or culture medium when produced by recombinant DNA techniques, 
or chemical precursors or other chemicals when chemically synthesized. Such proteins or 

1 S peptides are also characterized as being free of all other dog dander proteins. Accordingly, 
an isolated peptide having an activity of Cmifl or Can f II is produced recombinantly or 
synthetically and is substantially free of cellular material and culture medium or substantially 
free of chemical precursors or other chemicals and is substantially free of all other dog 
proteins. An isolated nucleic acid is also free of sequences which naturally flank the nucleic 

20 acid (i.e., sequences located at the 5' and 3* ends of the nucleic acid) in the organism from 
which the nucleic acid is derived. 

Peptides having an activity of Caoi" I or CaiJ II can be obtained, for example, by 
screening peptides recombinantly produced from the corresponding fragment of the nucleic 
acid of £mJ[ I or Cani'Il encoding such peptides. In addition, fragments can be chemically 

25 synthesized using techniques known in the art such as conventional Meuifield solid phase i- 
Moc or t-Boc chemistry. For example, the Can f I or Can f II protein may be arbitrarily 
divided into fragments of desired length with no overlap of the fragments, or preferably 
divided into overlapping fragments of a desired length. The fragments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptides having a Can f 

30 I or Can! II activity (i.e., the ability to induce a T cell response such as stimulation 

(proliferation, cytokine secretion), nonresponsiveness, and/or has reduced IgE binding 
activity). 

In one embodiment, peptides having an activity of Can f I or Can f II can be identified 
by the ability of the peptide to stimulate T cells or to induce T cell non-responsiveness. 
35 Peptides which stimulate T cells, as determined by; for example, T cell proliferation or 
cytokine secretion are defined herein as comprisitig at least one T cell epitope. T cell 
epitopes are believed to be involved in initiation and perpetuation of the immune response to 
the protein allergen which is responsible for the clinical symptoms of allergy. These T cell 
epitopes are thought to trigger early events at the level of the T helper cell by binding to an 
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appropriate HLA molecule on the surface of an antigen presenting cell, thereby stimulating 
the T cell subpopulation with the relevant T cell receptor for the epitope. These events lead 
to T cell proliferation, lymphokine secretion, local inflammatory reactions, recruitment of 
additional immune cells to the site of antigen/T cell interaction, and activation of the B cell 

5 cascade, leading to the production of antibodies. One isotype of these antibodies, IgE, is 
fundamentally important to the development of allergic symptoms and its production is 
mfluenced early in the cascade of events at the level of the T helper cell, by the nature of the 
lymphokines secreted A T cell epitope is the basic element, or smallest unit of recognition 
by a T cell receptor, where the epitope comprises amino acids essential to receptor 

1 0 recognition. Amino acid sequences which mimic those of the T cell epitopes and which 
modify the allergic response to protein allergens are within the scope of this invention. 

Screening peptides for those which retain a Canf I or Qmf II activity as described 
herein can be accomplished using one or more of several different assays. For example, /« 
vitro. Can f I or CaiLf n T cell stimulatoiy activity is assayed by contacting a peptide known 

1 5 or suspected of having a CanJI or Can f II activity vsdth an antigen presenting cell which 
presents appropriate MHO molecules in a T cell culture. Presentation of a peptide having a 
Canf I or Canf 11 activity in association with appropriate MHC molecules to T cells in 
conjunction with the necessary costimulation has the effect of transmitting a signal to the T 
cell that induces the production of increased levels of cytokines, particularly of interieukin-2 

20 and interleukin-4. The culture supernatant cian be obtamed and assayed for interleukin-2 or 
other known cytokines. For example, any one of several conventional assays for interieukin- 
2 can be employed, such as the assay described in PrOP, Natl. Acad. Sci USA, Sfi: 1333 
( 1 989) the pertinent portions of which are incorporated herein by reference. A kit for an 
assay for the production of interferon is also available from Genzyme Corporation 

25 (Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 
thymidine incorporation. The proliferation of T cells can be measured in vitro by 
determining the amount of 3H-labeled thymidine incorporated into the replicating DNA of 
cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell division can 

30 be quantified. 

In one embodiment, peptides which have £anJf I or CaiLf H T cell stimulating activity 
(i.e., the peptide comprises at least one T cell epitope) can be identified using an algorithm 
which predicts the presence of T cell epitopes in a protein sequence, such as the algorithm 
described by Hill et al., innmal of Tmmunologv, 142:189-197 (1991). The algorithm of Hill 
35 et al, predicts the location of T cell epitopes in a protein by the presence of certain patterns 
within the sequence which arc likely to bind MHC and therefore may contain T cell epitopes. 
Based on the Hill et al. algorithm, two 13 amino acid peptides (discussed in Example 10) 
have been identified and produced synthetically. Such peptides were tested for T cell activity 
- as described above (e.g., by measuring cellular uptake of tritiated thymidine). Specifically, 
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in Example 10, human T cell stimulating activity was tested by culturing T cells obtained 
from an individual sensitive to Can f I (i.c., an individual who has an IgE mediated inmiune 
response to Canf D with a peptide derived from CaiLf I and proliferation of T cells in 
response to the peptide was determined, e.g., by measuring cellular uptake of tritiated 
5 thymidine. Stimulation indices for responses by T cells to peptides were calculated as the 
maximum CPM in response to a peptide divided by the control CPM. A stimulation index 
(S.I.) equal to or greater than two times the background level was considered "positive". 
Positive results were used to calculate the mean stimulation index for each peptide for the 
group of patients tested (See Figiu« 25), Preferred peptides of this invention comprise at 

10 least one T cell epitope and have a mean T cell stimulation index of greater than or equal to 
2.0. A peptide having a mean T cell stimulation index of greater than or equal to 2,0 in a 
significant number of dog dander allergen sensitive patients tested is considered useful as a 
therapeutic agent Preferred peptides have a mean T cell stimulation index of at least 2.5, 
more preferably at least 3.0, more preferably at least 3.5, more preferably at least 4.0, more 

1 5 preferably at least 5.0 and most preferably at least about 6. For example, peptides having a 
Canf I activity and having a mean T cell stimulation index of at least 5, as indicated by data 
shown in Figure 25, include Construct 1 (SEQ ID NO:105). Construct 2 (SEQ ID NO:106), 
and Construct 3 (SEQ ID NO: 107). T cell epitopes can also be predicted and determined as 
described above for peptides derived from Canf II. 

20 In addition, preferred peptides have a positivity index (P.I.) of at least about 60, more 

preferably about 1 00, more preferably at least about 200 and most preferably at least about 
300, The positivity index for a peptide is determined by multiplying the mean T cell 
stimulation index by the percent of individuals, in a population of individuals sensitive to dog 
dander allergens (e.g., preferably a population of at least 12 individuals, more preferably a 

25 population of at least 30 individuals or more), who have a T cell stimulation index to such 
peptide of at least 2.0. Thus, the positivity index represents both the strength of a T cell 
response to a peptide (S.I.) and the frequency of a T cell response to a peptide in a population 
of individuals sensitive to dog dander allergens. In Figure 10, the bar represents the 
positivity index and the percent of individuals tested who have a T cell stimulation index of 

30 at least 2.0 to various peptides derived from CaiLf 1. For example, as shown in Figure 25, 
Peptide A0095 has a mean S.L of 3.0 and 43% of positive responses in the group of 
individuals tested resulting in a positivity index of 129. 

In order to determine precise T cell epitopes by, for example, fine mapping 
techniques, a peptide having Qml I or £30111 T cell stimulating activity and thus 

35 comprising at least one T cell epitope as determined by T cell biology techniques is modified 
by addition or deletion of amino acid residues at either the amino or carboxy terminus of the 
peptide and tested to determine a change in T cell reactivity to the modified peptide. 
Following this technique, peptides are selected and produced recombinantly or synthetically. 
Peptides are selected based on various factors, including the strength of the T cell response to 
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the peptide (e.g., stimulation index), the frequency of the T cell response to the peptide in a 
population of individuals sensitive to dog dander allergens, and the potential cross-reactivity 
of the peptide with other dog dander allergens. The physical and chemical properties of these 
selected peptides (e.g., solubility, stability) are exammed to determine whether the peptides 
5 are suitable for use in therapeutic compositions or whether the peptides require modification 
as described herein. The ability of the selected peptides or selected modified peptides to 
stimulate human T cells (e.g., induce proliferation, lymphokine secretion) is then determined 
as described herein. 

In another embodiment, a peptide having a CanJ I or Caaf n activity is screened for 
1 0 the ability to induce T cell non-responsiveness. The ability of a peptide known to stimulate T 
cells (as determined by one or more of the above described assays), to inhibit or completely 
block the activity of purified native Caaf I or Cani H or portion thereof and induce a state of 
non-responsiveness can be detennined using subsequent attempts at stimulation of the T cells 
with antigen presenting cells that present native Caof I or Cattill or peptide having a Qml I 
1 5 or Canf II activity following exposure to the peptide, having a Qml I or £aiLf H activity. If 
the T cells are unresponsive to the subsequent activation attempts, as determined by 
intcrleukin-2 synthesis and/or T cell proliferation, a state of non-responsiveness has been 
induced. Ssfi, e.g., Gimmi sUL. (1993) Proff. Natl. Acad, Sci USA, 2Q: 6586 - 6590; and 
Schwartz (1990) Science . 248 : 1349 - 1356, for assay systems that can be used as the basis 
20 for an assay in accordance with the present invention. 

In yet another embodiment, peptides having a CanJ I or Canf H activity are 
identified by IgE binding activity. For therapeutic purposes, peptides of the invention 
preferably do not bind IgE specific for a dog dander allergen, or bind such IgE to a 
substantially lesser extent than the corresponding purified native dog dander allergen binds 
25 such IgE. Reduced IgE binding activity refers to IgE binding activity that is less than that of 
purified native Canf I or Can f II protein. If a peptide having a QsJlSl or £aaf H activity is 
to be used as a diagnostic reagent, it is not necessary that the peptide have reduced IgE 
binding activity compared to the native CmLf I or Canl II allergen. IgE binding activity of 
peptides can be determined by, for example, an enzyme-linked inwnunosorbent assay 
30 (ELISA) using, for example, sera obtained from a subject, (i.e., an allergic subject) that has 
been previously exposed to the native CaiLf I or Cmf II allergen. Briefly, the peptide 
suspected of having a CaiLf I or Canill activity is coated onto wells of a microtiter plate. 
After washing and blocking the wells, antibody solution consisting of the plasma of an 
allergic subject who has been exposed to a peptide suspected of having a Cani I or Cmif H 
35 activity is incubated in the wells. The plasma is generally depleted of IgG before incubation. 
A labeled secondary antibody is added to the wells and incubated. The amount of IgE 
binding is then quantified and compared to the amount of IgE bound by a purified, native 
Canf I or Canf II protein. Alternatively, the IgE binding activity of a peptide can be 
determined by Western blot analysis. For example, a peptide suspected of having a Caaf I or 
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Can f II activity is run on a polyacrylamide gel using SDS-PAQE., The peptide is then 
transferred to nitrocellulose and subsequently incubated with sera from an allergic subject. 
After incubation with a labeled secondary antibody, the amount of IgE bound is then 
determined and quantified. 
5 Another assay which can be used to detennine the IgE binding activity of a peptide is 

a competition ELISA assay. Briefly^ an IgE antibody pool is generated by combining plasma 
from dog dander allergic subjects that have been shown by direct ELISA to have IgE reactive 
with native Canf I or Can f IL This pool is used in ELISA competition assays to compare 
IgE binding of native Can f I or CaiLf H and a peptide suspected of having a Can f I or Can f 

10 II activity. IgE binding for the native Can f I or Can f II protein and a peptide suspected of 
having a Can f I or CaiLf H activity is detennined and quantified. 

If a peptide having an activity of £aiLf I or £an_f II binds IgE, and is to be used as a 
therapeutic agent, it is preferable that such binding does not result in the release of mediators 
(e.g., histamines) from mast cells or basophils. To detennine whether a peptide which binds 

1 5 IgE results in the release of mediators, a histamine release assay can be performed using 
standard reagents and protocols obtained, for example, from Amac, Inc. (Westbrook, ME). 
Briefly, a buffered solution of a peptide suspected of having a £aiLf I or Canf II activity is 
combined with an equal volume of whole heparinized blood from an allergic subject. After 
mixing and incubation, the cells are pelleted and the supematants are processed and analyzed 

20 iising a radioimmunoassay to determine the amount of histamine released. 

Peptides having an activity of £mLf I or £aiLf H which are to be used as therapeutic 
agents are preferably tested in manmialian models of dog dander atopy, such as the mouse 
model disclosed in Tamura fiLaL, (1986) Microbiol. Immunol. - 2Q: 883 - 896, or in U.S. 
Patent 4,939,239, or in the primate model disclosed in Chiba et al. . (1990) Int. Arch. Allerg y 

25 Immunol.. 22: 83-88. Initial screening for IgE binding to a peptide having an activity of 
Can f I or CaiLf II may be performed by scratch tests or intradermal skin tests on laboratory 
animals or hxmian volunteers, or in in vitro systems such as RAST, RAST inhibition, ELISA 
assay, RIA (radioimmunoassay), or a histamine release assay, as described above. 

It is possible to modify the structure of a peptide having an activity of Can f I or Can f 

30 II for such purposes as increasing solubility, enhancing therapeutic or prophylactic efficacy, 
or stability (e.g., shelf life ex vivo and resistance to proteolytic degradation irt vivo). Such 
modified peptides are considered functional equivalents of peptides having an activity of Can 
f I or Canf II as defined herein. A modified peptide can be produced in which the amino 
acid sequence has been altered, such as by amino acid substitution, deletion, or addition, to 

35 modify inununogenicity and/or reduce allergenicity, or to which a component has been added 
for the same purpose. 

For example, a peptide having an activity of Can f I or £aiLf II can be modified so 
that it maintains the ability to induce T cell non-responsiveness and bind MHC proteins 
without the ability to induce a strong proliferative response or possibly, any proliferative 
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response vAtcn administered in immunogenic form. In this instance, critical binding residues 
for T cell receptor function can be determined using known techniques (e.g., substitution of 
each residue and determination of the presence or absence of T cell reactivity). Those 
residues shown to be essential to interact with the T cell receptor can be modified by 
5 replacing the essential amino acid with another, preferably similar amino acid residue (a 
conservative substitution) whose presence is shown to enhance, diminish but not eliminate, 
or not affect T cell reactivity. In addition, those amino acid residues which are not essential 
for T cell receptor interaction can be modified by being replaced by another amino acid 
whose incorporation may enhance, diminish but not eliminate, or not affect T cell reactivity, 

1 0 but does not eliminate binding to relevant MHC. 

Additionally, a peptide having an activity of CaiLf I or CaoiH can be modified by 
replacing an amino acid shown to be essential to interact with the MHC protein complex with 
another, preferably similar amino acid residue (conservative substitution) whose presence is 
shown to enhance, diminish but not eliminate, or not affect T cell activity. In addition, amino 

1 5 acid residues which are not essential for interaction with the MHC protein complex but which 
still bind the MHC protein complex can be modified by being replaced by another amino acid 
whose incorporation may enhance, not affect, or diminish but not eliminate T cell reactivity. 
Preferred amino acid substitutions for non-essential amino acids include, but are not limited 
to substitutions with alanine, glutamic acid, or a methyl amino acid. 

20 Another example of modification of a peptide having an activity of CaaJ I or Can f II 

is substitution of cysteine residues preferably with alanine, serine, threonine, leucine or 
glutamic acid residues to minimize dimerization via disulfide linkages. In addition, amino 
acid side chains of fragments of the protein of the invention can be chemically modified. 
Another modification is cyclization of the peptide. 

25 In order to enhance stability and/or reactivity, a peptide having an activity of CanJ"! 

or Can f II can be modified to incorporate one or more polymorphisms in the amino acid 
sequence of the protein allergen resulting firom any natural allelic variation. Additionally, D- 
amino acids, non-natural amino acids, or non-amino acid analogs can be substituted or added 
to produce a modified protein within the scope of this invention. Furthermore, a peptide 

30 having an activity of £aaf I or Can f II can be modified using polyethylene glycol (PEG) 
according to the method of A. Sehon and co-workers (Wie et al. , supra) to produce a protein 
conjugated with PEG. In addition, PEG can be added during chemical synthesis of the 
protein. Other modifications of a peptide having an activity of CanJ lor Cm f II include 
reduction/alkylation (Tarr, Methods of P rntein Microcharacterization. J, E. Silver ed,, 

35 Humana Press, Clifton NJ 155-194 (1986)); acylation (Tair. supxs); chemical coupling to an 
appropriate carrier (Mishell and Shiigi, eds. Selected Met hods in Cellular Immunologv. WH 
Freeman, San Francisco, CA (1 980), U.S. Patent 4,939,239; or mild formalin treatment 
; (Mnrch (1Q71) Int. Arch. of Allergv and AnnMmmunQl»41: 199 > 215). 
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' »H ' ; To facilitate purification and potentially^inci^e splubUity of^^ having an 

activity nf Canf 1 or Can f IL it is possible to add an amino acid fusion moiety to the peptide 
backbone. For example, hexa-histidine can be added to the protein for purification by 
immobilized metal ion affinity chromatography (Hochuli, E. fiLaL, (1988) Bio/Technology. 
5 Sl: 1321 - 1325). In addition, to facilitate isolation of peptides free of irrelevant sequences, 
specific ehdoprotease cleavage sites can be introduced between the sequences of the fusion 
moiety and the peptide. In order to successfully desensitize a subject to £aaf I or Can f II 
protein or related allergen, it may be necessary to increase the solubility of the protein by 
adding functional groups to the protein, or by omitting hydrophobic regions of the protein. 

10 To potentially aid proper antigen processing of T cell epitopes within Cmf I or Can f 

II, canonical protease sensitive sites can be engineered between regions, each comprising at 
least one T cell epitope via recombinant or synthetic methods. For example, charged amino 
acid pairs, such as KK or RR, can be introduced between regions within a protein or fragment 
during recombinant construction th^eof The resulting peptide can be rendered sensitive to 

1 5 cleavage by cathepsin and/or other trypsin-like enzymes which would generate portions of 
the protein containing one or more T cell epitopes. In addition, such charged amino acid 
residues can result in an increase in the solubility of the peptide. 

Site-directed mutagenesis of a nucleic acid encoding a peptide having an activity of 
Can f I or Qan f II can be used to modify the structure of the peptide by methods known in 

20 the art. Such methods may, among others, include polymerase chain reaction (FCR) with 
oligonucleotide primers bearing one or more mutations (Ho sLaL ( 1 989) Qsne, ZZ: 5 1 - 59) 
or total synthesis of mutated genes (Hostomsky, Z. £LaL, (1989) Btochem. Biophvs. Res. 
Comm . 161 : 1056 - 1063)- To enhance recombinant protein expression, the aforementioned 
methods can be applied to change the codons present in the cDNA sequence of the invention 

25 to those preferentially utilized by the host cell in which the recombinant protein is being 
expressed (Wada fiLaL» SUpm). 

Another aspect of the invention pertains to an antibody specifically reactive with a 
peptide having an activity of Canf I or CsjLf II. The antibodies of this invention can be used 
to standardize allergen extracts or to isolate the naturally-occurring or native form ofQmSl 

30 or CanJ'II. For example, by using peptides having an activity of CstlII or £aiLf II based on 
the cDNA sequence of CaO-f I or Canf IL anti-protein/anti-peptide antisem or monoclonal 
antibodies can be made using standard methods. A maimnal such as a mouse, a hamster or 
rabbit can be immunized with an immunogenic form of the peptide (e.g., QsnSl or Caiuf II 
protein or an antigenic fragment which is capable of eliciting an antibody response). 

35 Techniques for conferring immunogenicity on a protein or peptide include conjugation to 
carriers or other techniques well known in the art. A peptide having an activity of CanJ^I or 
Can f II can be administered in the presence of adjuvant The progress of immunization can 
be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other 
immunoassay can be used with the immunogen as antigen to assess the levels of antibodies. 
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Following immunization, anti-CanJT 1 r anti-Cailf H antisera can be obtained and. if 
desired, polyclonal anti-£aiLf I or anti-CanJ II antibodies isolated from the serum. To 
produce monoclonal antibodies, antibody producing cells (lymphocytes) can be harvested 
from an immtinized animal and fused by standard somatic cell fiision procedures with 
5 immortalizing ceUs such as myeloma cells to yield hybridoma cells. Such techniques are 
well known in the art, for example the hybridoma technique originally developed by Kohler 
and Milstein, (1975) Matuifi, 25fi: 495 - 497) as well as other techniques such as the human B 
cell hybridoma technique (Kozbar £LflL, (1983) ImmuflologY TodSY. 4: 72) and the EBV- 
hybridoma technique to produce human monoclonal antibodies (Cole sUL, (1985) 
10 M,.nnr1nnal An fi^r^rfips and Cancer Therapv. Alan R. Liss, Inc. pp. 77-96). Hybridoma cells 
can be screened immunochemically for production of antibodies specifically reactive with a 
peptide having an activity of CanJTI or CaoiH and the monoclonal antibodies isolated. 

The term antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with the peptide having an activity of Cailfl or CaiLfH. 
1 5 Antibodies can be fragmented using conventional techniques and the Augments screened for 
utility in the same manner as described above for whole antibodies. For example, F(ab')2 
fragments can be generated by treating antibody with pepsin. The resulting F(ab')2 fragment 
can be treated to reduce disulfide bridges to produce Fab' fragments. The antibody of the 
present invoition is fiirther intended to include bispecific and chimeric molecules having an 
20 anti- Can f I or anti-CaP-f II portion. 

Another aspect of this invention provides T cell clones and soluble T cell receptors 
specifically reactive with a peptide having an activity of Cani"! or Caof II. Monoclonal T 
cell populations (i.e., T cells genetically identical to one another and ejqjressing identical T 
cell receptors) can be derived fix)m an individual sensitive to Caof I or Caai II, followed by 
25 repetitive in vitro stimulation with a fiaaf I or Caoi II protein or peptide having an activity 
of Caaf I nr Can f ll in the presence of MHC-matched antigen-presenting cells. Single CassJ 
I or Can f II MHC responsive cells can then be cloned by limiting dilution and permanent 
lines expanded and maintained by periodic in vitro restimulation. Alternatively, Caaf I or 
Caafll specific T-T hybridomas can be produced by a technique similar to B cell hybridoma 
30 production. For example, a mammal, such as a mouse, is immunized with a peptide havmg 
an activity of Cmf I or Caaf H, T cells are then purified and fused with an autonomously 
growing T cell tumor line. From the resulting hybridomas, cells responding to a peptide 
having an activity of Can f I or £aiLf II are selected and cloned. Procedures from 
propagating monoclonal T cell populations are described in Cellvtlar and Molgcular 
35 Trnmiinoloyv (Abul K. Abbas fiLaL ed.), W.B. Saunders Company, Philadelphia, PA (1991) 
page 139. Soluble T cell receptors specifically reactive with a peptide having an activity of 
Cani"! or Can f II can be obtained by immunoprecipitation using an antibody against the T 
cell receptor as described in Immunology : A Synthesis (Second Edition), Edward S. Golub 
. . et al. . ed., Sinauer Associates, Inc., Suiiderland, MA (1991) pages 366-269. 
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T cell clones specifically reactive with a peptide having an activity of CanJf I pr Can f 
U can be used to isolate and molecularly clone the gene encoding the relevant T cell receptor. 
In addition, a soluble T cell receptor specifically reactive with a peptide having an activity of 
Can f 1 or Can f II can be used to interfere with or inhibit antigen-dependent activation of the 
5 relevant T cell subpopulation, for example, by administration to an individual sensitive to 
Caaf I or Can f II. Antibodies specifically reactive with such a T cell receptor can be 
produced according to the techniques described herein. Such antibodies can be used to block 
or interfere with the T cell interaction with peptides presented by MHO. 

Exposure of allergic subjects to peptides having an activity of CaaJI or £aiLf II and 

10 which have T cell stimulating activity, may cause the appropriate T cell subpopulations to 
become non-responsive to the respective protein allergen (e.g., fail to stimulate an inmiune 
response upon such exposure). In addition, such administration may modify the lymphokine 
secretion profile as compared with exposure to the naturally-occurring protein allergen or 
portion thereof (e.g., result in a decrease of IL-4 and/or an increase in IL-2). Furthermore, 

1 5 exposure to peptides having an activity of Caaf I or Canf II which have T cell stimulating 
activity may influence T cell subpopulations which normally participate in the response to 
the allergen such that these T cells are drawn away from the site(s) of normal exposure to the 
allergen (e.g., nasal mucosa, skin, and lung) towards the site(s) of therapeutic administration 
of the protein or fragment derived therefrom. This redistribution of T cell subpopulations 

20 may ameliorate or reduce the ability of an individual's inunune system to stimulate the usual 
immune response at the site of normal exposure to the allergen, resulting in a diminution in 
allergic symptoms. 

A peptide having an activity of Canf 1 or Can f II when administered to a subject 
sensitive to dog dander allergens is capable of modifying the B cell response, T cell response, 

25 or both the B cell and the T cell response of the subject to the allergen. As used herein, 

modification of the allergic response of a subject to a dog dander allergen can be defined as 
non-responsiveness or diminution in symptoms to the allergen, as determined by standard 
clinical procedures (S^ e.g., Vamey fiUL, (1990) British Medical Journal. 2fl2: 265 - 269), 
including diminution in dog dander induced asthmatic symptoms. As referred to herein, a 

30 diminution in symptoms includes any reduction in the allergic response of a subject to the 
allergen following a treatment regimen with a peptide of the invention. This diminution in 
symptoms may be determined subjectively (e.g., the patient feels more comfortable upon 
exposure to the allergen), or clinically, such as with a standard skin test 

Peptides or antibodies of the present invention can also be used for detecting and 

35 diagnosing sensitivity to Cmf I or £arLf H. For example, this could be done by combining 
blood or blood products obtained from a subject to be assessed for sensitivity with peptide 
having an activity of Can f I or CanJ" H, under conditions appropriate for binding of 
components in the blood (e.g., antibodies, T cells, B cells) with the peptide(s) and 
determining the extent to which such binding occurs. Other diagnostic methods for allergic 
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diseases which the peptides or antibodies of the present invention can be used include radio- 
allergosorbent test (RAST), paper radioimmunosorbent test (PRIST), enzyme linked 
immunosorbent assay (ELIS A), radioimmunoassays (RIA), immuno-radiometric assays 
(IRMA), luminescence immunoassays (LIA), histamine release assays and IgE immimoblots. 
5 The present invention further provides methods of detecting and treating sensitivity in 

a subject toCmf 1 orCaiLf H. The presence in subjects of IgE specific forCaiLfl or Can f 
II and the ability of T cells of the subjects to respond to T cell epitopes of CanJ I or Cmtf II 
can be determined by administering to the subject an Immediate Type Hypersensitivity test 
and/or a Delayed Type Hypersensitivity test (SfiC e.g., Tmmnnolfigy (1985) Roitt, LM., 
1 0 Brostoff, J., Male, D.K. (eds), C.V. Mosby Co., Gower Medical Publishing, London, NY, pp. 
19.2-19.18; pp.22. 1-22. 10) utilizing a peptide having an activity of Caaf I or Caul II, or a 
modified form of a peptide having an activity of Caaf I or Cani* II, each of which binds IgE 
specific for the allergen. The same subjects are administered a Delayed Type 
Hypersensitivity test prior to, simultaneously with, or subsequent to administration of the 
1 5 Immediate Type Hypersensitivity test. Of course, if the Immediate Type Hypersensitivity 
test is administered prior to the Delayed Type Hypersensitivity test, the Delayed Type 
Hypersensitivity test would be given to those subjects exhibiting a specific Immediate Type 
Hypersensitivity reaction. The Delayed Type Hypersensitivity test utilizes a peptide having 
an activity nf Can f I or Canf H which has hiunan T cell stimulating activity and which does 
20 not bind IgE specific for the allergen in a substantial percentage of the population of subjects 
sensitive to the allergen (e.g., at least about 75%). Those subjects found to have both a 
specific Immediate type Hypersensitivity reaction and a specific Delayed Type 
Hypersensitivity reaction are administered an amount of a composition suitable for 
pharmaceutical administration. The composition comprises the peptide having an activity of 
25 Can f I or Canf II as used in the Delayed Type Hypersensitivity test and a pharmaceutical! y 
acceptable carrier or diluent. 

A peptide having an activity of Can f I or CaaXU can be used in methods of 
diagnosing, treating, and preventing allergic reactions to a dog dander allergen or a cross- 
reactive protein allergen. Thus, the present invention provides compositions suitable for 
30 pharmaceutical administration comprising an amount of at least one peptide having an 

activity of Canf I or Can f II and a pharmaceutically acceptable carrier. Administration of 
the compositions of the present invention to a subject to be desensitized can be carried out 
using known pirocedures, at dosages and for periods of time effective to reduce sensitivity 
(i.e., reduce the allergic response) of the subject to a dog dander allergen. The term subject is 
35 intended to include living organisms in which an immune response can be elicited, e.g., 
niammals. Examples of subjects include humans, dogs, cats, mice, rats, and transgenic 
species thereof An amoxmt of at least one peptide having an activity of £aaf I or Can f II 
necessary to achieve a therapeutic effect may vary according to factors such as the degree of 
sensitivity of the subject to dog dander, the age, sex, and weight of the subject, and the ability 
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of a peptide having ^ oTCaSjTor antigenicresp'onseiih' the 

subject. Dosage regima may be adjiisted to provide the optimum therapeutic respo^^ ror 
example, several divided doses may be administered daily' or the dose may be proportionally 
reduced as indicated by the exigencies of the therapeutic' situation. 
5 The active compound (i.e., a peptide having an activity of CsD-f I or £aiLf H) may be 

administered in a convenient manner such as by injection (subcutaneous, intravenous, etc.), 
oral administration, inhalation, transdermal application, or rectal administration. Depending 
on the route of administration, the active compound may be coated in a material to protect the 
compound from the action of enTymes, acids and other natural conditions which may 

10 inactivate the compound. 

To administer a peptide having an activity of Can f I or Caoi" H by other than 
parenteral administration, it may be necessary to coat the peptide with, or co-administer the 
peptide with, a material to prevent its inactivation. For example, a peptide having an activity 
of Can f I or CanJf II may be admmistered to an individual in an appropriate carrier, diluent 

1 5 or adjuvant, co-administered with enzyme inhibitors or in an appropriate carrier such as 

liposomes. Pharmaceutically acceptable diluents include saline and aqueous buffer solutions. 
Adjuvant is used in its broadest sense and includes any immune stimulating compound such 
as interferon. Adjuvants contemplated herein include resorcinols, non-ionic surfactants such 
as polyoxyethylene oleyl ether and n-hexadecyl polyethylene ether. Enzyme inhibitors 

20 include pancreatic trypsin inhibitor, diisopropylfluorophosphate (DEP) and trasylol. 

Liposomes include water-in-oil-in-water CGF emulsions as well as conventional liposomes 
(Strejan sLaL, (1984) J. NeurQimmunoL. I: 27). For purposes of inducing T cell 
nonresponsiveness, the composition is preferably administered in non-immunogenic form, 
e.g., one that does not contain adjuvant. 

25 The active compoimd may also be administered parenterally or intraperitoneally. 

Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures 
thereof and in oils. Under ordinary conditions of storage and xise, these preparations may 
contain a preservative to prevent the growth of microorganisms. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 

30 solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. In all cases, the composition must be 
sterile and must be fluid to the extent that easy syringability exists. It must be stable xmder 
the conditions of manufacture and storage and mxist be preserved against the contaminating 
action of microorganisms such as bacteria and fimgi. The carrier can be a solvent or 

35 dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and the like), suitable mixtures thereof, 
and vegetable oils. The proper fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the required particle size in the case of 
dispersion and by the use of surfactants. Prevention of the action of microorganisms can be 
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achieved by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars, polyalcohols such as mannitol, 
sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 
5 compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating active compound (i.e., a 
peptide having an activity of Can f I or CaafU) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, followed by 

1 0 filtered sterilization. Generally, dispersions are prepared by incorporating the active 

compound into a sterile vehicle which contains a basic dispersion medium and the required 
other ingredients firom those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are vacuum 
drying and freeze-drying which yields a powder of the active ingredient (i.e., at least one 

1 5 peptide having an activity of Can f I or CaiLf H) plus any additional desired ingredient from a 
previously sterile-filtered solution thereof 

When the peptide having an activity of CaiLfl or CaolU is suitably protected, as 
described above, the peptide may be orally administered, for example, with an inert diluent or 
an assimilable edible canier. The peptide and other ingredients may also be enclosed in a 

20 hard or soft shell gelatin capsule, compressed into tablets, or incorporated directly into the 
individual's diet For oral therapeutic administration, the active compound may be 
incorporated with excipients and used in the form of ingestible tablets, buccal tablets, troches, 
capsules, elixirs, suspensions, syrups, wafers, and the like. The percentage of the 
compositions and preparations may, of course, be varied and may conveniently be between 

25 about 5 to about 80% of the weight of the unit. The amount of active compound in such 
therapeutically useful compositions is such that a suitable dosage will be obtained. 

As used herein "pharmaceutically acceptable carrier" includes any and all solvents, 
dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, and the like. The use of such media and agents for pharmaceutically active 

30 substances is well known in the art. Except insofar as any conventional media or agent is 
incompatible with the active compound, use tiiereof in the therapeutic compositions is 
contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 

It is especially advantageous to formulate parenteral compositions in dosage unit form 
35 foreaseof administration and uniformity of dosage. Dosage imit form as used herein refers 
to physically discrete units suited as unitary dosages for the mammalian subjects to be 
treated; each xmit containing a predetennined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharma^^ 
The specification for the dosage unit forms of tke ihvention are dictated by and directly 
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dependent on (a) the unique chsuacteristics of th^ actwe.cpmpom 

therapeutic effect to be achieved, and (b) the limitations inherent in the art of compounding 
such an active compound for the treatment of sensitivity in subjects. 

The present invention also provides a composition comprising at least two peptides 
5 having an activity of Can f I or Caof II (e.g., a physical mixture of at least two peptides), 
each having T cell stimulating activity. For example, at least two peptides each having as 
activity of Can f I can be combined or at least two peptides each having an activity of Can fll 
can be combined, or at least one peptide having an activity of Caaf I and at least one peptide 
having an activity of Can f ll can be combined and administered. Alternatively, a peptide 

1 0 having at least two regions, each having T cell stimulating activity (i.e., each region 

comprising at least one T cell epitope) can be administered to an allergic subject. Such a 
peptide can have at least two regions derived firom the same allergen, QsdSI OTQsskflh or a 
combination of Can f I and Can f XL A composition of two peptides or a peptide having at 
least two regions can be administered to a subject in the form of a composition with a 

1 5 pharmaceutically acceptable carrier as hereinbefore described. An amoimt of one or more of 
such compositions can be administered simultaneously or sequentially to a subject sensitive 
to a dog dander allergen to treat such sensitivity. 

The cDN A (or the mRNA which served as a template during reverse transcription) 
encoding a peptide having an activity of Cani^I or CaiLf II can be used to identify similar 

20 nucleic acid sequences in any variety or type of animal and, thus, to molecularly clone genes 
which have sufficient sequence homology to hybridize to the cDNA encoding a peptide 
having an activity of Can f l or Can f II. Thus, the present invention includes not only 
peptides having an activity of £aiLf I or CaiLf II, but also other proteins which may be 
allergens encoded by DNA which hybridizes to DN A of the present invention. 

25 Isolated peptides that are immunologically related to Can f I or Cm! II> such as by 

antibody cross-reactivity or T cell cross-reactivity, other than those already identified, are 
within the scope of the invention. Such peptides bind antibodies specific for the protein and 
peptides of the invention, or stimulate T cells specific for the protein and peptides of this 
invention. 

30 A peptide having an activity of CaiLf I or Caof H (i.e., £aa.f I or £m£ll produced 

recombinantly or by chemical synthesis) is free of all other dog dander proteins and, thus, is 
useful in the standardization of allergen extracts which are key reagents for the diagnosis and 
treatment of dog dander hypersensitivity. In addition, such a peptide is of a consistent, well- 
defined composition and biological activity for use in preparations which can be administered 

35 for therapeutic puijwscs (e.g., to modify the allergic response of a subject sensitive to dog 
dander). Such peptides can also be used to study the mechanism of immunotherapy of Cams 
familiaris allergy and to design modified derivatives or analogs useful in immunotherapy. 

Work by others has shown that high doses of allergen extracts generally produce the 
best results during immunotherapy (i.e., best symptom relief). However, many subjects are 

r 
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unable to tolerate large doses of such extracts due to systemic reactions elicited by the 
allergens and other components within these preparations. A peptide having an activity of 
Can f l or CaiLf II of the invention has the advantage of being free of all other dander protein. 
Thus, such a peptide can be administered for therapeutic purposes. 
5 It is now also possible to design an agent or a drug capable of blocking or inhibiting 

the ability of a dog dander allergen to induce an allergic reaction in sensitive subjects. Such 
agents could be designed, for example, in such a manner that they would bind to relevant 
anti-Caof I or anti-CaO-f n IgE molecules, thus preventing IgE-allergen binding, and 
subsequent mast cell/basophil degranulation. Alternatively, such agents could bind to 

1 0 cellular components of the immune system, resulting in suppression or desensitization of the 
allergic responses to dog dander allergens. A non-restrictive example of this is the use of 
peptides including B or T cell epitopes of Cmf I or CanIII, or modifications thereof, based 
on the cDNA protein structure of Cani I or CanJH to suppress the allergic response to a dog 
dander allergen. This could be carried out by defining the structures of fragments encoding B 

15 and T cell epitopes which affect B and T cell function in in vitro studies with blood 
components from subjects sensitive to dog dander. 

The invention is further illustrated by the following examples which should not be 
construed as further limiting the subject invention.^ The contents of all references and 
published patent applications cited throughout this application are hereby incorporated by 

20 reference. 

V.VAMPT.E 1; Protein S egiiencc Analysis of Purified Can f 1 

Affinity purified Canf l protein was obtained from Dr. Aalberse (de Groot, H. sLsL, 
SUprn). An Applied Biosystems Model 477A gas phase sequencer with on-line 

25 phynylthiohydantoin (HTH) amino acid analysis (Model 120A) was used to sequence the 
purified Canfl protein. A modification of the extraction program, multiple butylchloride 
extractions, was used to improve the amino acid recovery. O-phthaladehyde (OP A) was used 
in blocking of primary amines when proline was located at the amino terminus. Brauer, 
A.W., £LaL, (1984) Anal Rmchemistrv. 121: 134, 142. In situ alkylation was performed by 

30 using the non-nucleophilic reductant, tributylphosphine with concomitant alkylation by 4- 
vinyl pyridine in ethylmorpholine buffer. Andrews, P.C. and Dixon, J.E., (1987) AnaL 

BiQchemistry,16I: 524-528. 

Using this methodology, the sequence of the N-terminus of the Can f I protein was 
determined, contrary to previous reports that the N-terminus is blocked (Schou, C. eLaL, 
35 supra) . The N-terminal sequence of 65 amino acid residues which was identified through 
multiple N-terminal sequence analysis in conjunction with OPA blocking of contaminating 
signal represents a novel protein sequence (Figure 5). The CaiLf I protein sequence was 
cohfirmed'and expanded by sequence analysis of CNBr cleaved peptides. In situ CNBr 
digestion f Caiif l on the sequence glass -filter disk provided additional protein sequence 
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mf^iWo-^f ^"^-^^'^r*'""-'^ ^ ^ and.Nice- RG„.n 984;>iRinchem. International. 8: 787^ - 791 . 
Prior to the in situ CNBr cleavage, forty-four cycles of amino acid sequencing were 
performed and then the protein sample was treated with OPA to block all amino groups. 
After five hours of in situ CNBr digestion, three major sequences were identified 
5 corresponding to the sequences after Met46, Met30 and an unknown Met, later shown to be 
Metl03. An additional OPA block after cycle 18 (before Pro65) extended the sequence to 
Asp86. Sequence analysis of CNBr peptide fragments isolated by HPLC (Applied 
Biosystem, Inc., Model 130, C8 Colunm) further extended the N-terminal sequence to 
ninety-four amino acid residues. In situ CNBr cleavage in conjimction with OPA blocking 

10 also identified a 39 amino acid residue peptide (residues 104-142), A potential N- 

glycosylation site was found in the cDNA deduced amino acid sequence, Asn54 - Ile55 - 
Thr56. The protein sequence analysis identified the Ile55 and Thr56 of CaiLf I, however, 
nothing could be identified at the position 54. This suggests that post-translation 
modification occurs at Asn54 of Qan f I and the modification is stable to the trifluoroacetic 

15 acid treatment during protein sequencing. 

F.YAMPI.E 2z Extr action of mRNA From Canine Parotid Glands 

and Cloning of Can f I 

A pair of firesh parotid glands from a single outbred dog were obtained from the Tufts 

20 University School of Veterinary Medicine (Worcester, MA) and, washed in phosphate 
buffered saline, and inmiediately fit)zen on dry ice. KNA was extracted essentially as 
described in the literature (Chirgwin, J, M. sLaL, (1979) Biochemistry. 1£: 5294 - 5299.). 
One gland was pidverized to a powder with a mortar and pestle frozen in liquid N2, and 
suspended in 25 ml of GTC buffer (50% w/v guanidine thiocynate, 0.5% w/v Na lauryl 

25 sarcosine, 0.7% v/v B-mercaptoethanol, 0.1% v/v Sigma Antifoam A, 25 mM Na citrate, pH 
7.0) and vortexed until dissolved. Genomic DNA present in the solution was sheared by 
forcing the solution through a 16 gauge needle until the viscosity of the solution no longer 
decreased. The sheared solution was centrifiiged at 3 K rpm for 5 minutes at room 
temperature. The supernatant was then sheared further through a 23 gauge needle until its 

30 viscosity no longer decreased, and cleared by centrifrigation at 5 K rpm for 5 minutes at room 
temperature. The solution was layered onto a CsCl cushion (5.7 M CsCl, 10 mM EDTA pH 
7.5) and ultracentrifiiged in a Beckman SW 41 Ti rotor at 35 K rpm for 16 hours at 20*^ C. 
The supernatant was discarded and the RNA pellet washed in 70% EtOH then resuspended in 
0.3 M NaOAc, 10 mM EDTA, 0.1% SDS. Two volumes of absolute EtOH were added, and 

35 precipitation carried out on dry ice. RNA was pelleted by centrifiigation, 70% EtOH washed, 
and resuspended in TES (10 mM Tris, 1 mM EDTA, 0. 1 % SDS), The final yield was -1.8 
mg. 

Single strand total dog parotid gland cDNA was synthesized using the above RNA 
preparation as a template in reverse transcription. 4 ^g of total RNA were EtOH. precipitated 
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(using glycogen as carrier: molecular biology grade Boerhinger Manheim), 70% EtOH 
washed, and rcsuspended in 10 ^il of dH20. Oligo dT(12-l 8) was added to 50 ^g/ml and the 
RN A denatured at 70° C for 5 niinutes. The reaction was quick chilled on ice and 1 ^il (40 
units) of RNAsin (Promega) was added as a prophylactic against contaminating RNases. The 
5 components from the BRL Superscript™ Reverse Transcriptase Kit were added as follows: 4 
^l 5 X buffer, 2 nl 0-1 M DTT, 1 ^1 10 mM dNTP mix. After warming the reaction to 37° C, 
1 ^il (200 units) of Superscript™ Reverse Transcriptase was added, and the reaction allowed 
to proceed for one hour at 37° C. Reverse transcription was terminated by incubation at 70° 
C for 15 minutes, and the reaction stored at -20° C. 
1 0 Initially the MOP AC (mixed oligonucleotides primed amplification of cDN A) 

technique of PGR amplification (Lee, C. C. sLaL. (1988) Sfiienes. 222: 1288 - 1291) was 
used to obtain a partial cDNA of Can f I encoding amino acids 14 to 29 of the mature protein. 
Using dog parotid cDNA as a template with degenerate primer pairs (synthesized on an 
Applied Biosystems 392) based on residues 9 to 15 (SEQ IDN0:3) [Fig.l, SI A (SEQ ID 
15 N0:4) or SIB (SEQ ID NO:5)] and 30 to 37 (SEQ ID NO: 10) [Figure 1, AS2A (SEQ ID 
NO:l 1) or AS2B (SEQ ID NO:12)] of mature Cmlh a DNA fragment of the predicted size 
(--3 X 28 amino acids, or 84 bp) could be amplified using a PGR kit (GeneAmp kit, Perkin 
Elmer Cetus, Norwalk, CT) in conjunction with the following program in an MJ Research 
Minicycler: 40 X (92^* C 30 seconds/55^ C 1 minutes/75*' C 1 minutes). The primer pair 5' 
20 SlB/3* AS2B amplified the predicted fragment with the greatest efficiency, inferring that in 
both coding regions, the leucine residue was encoded by CTX rather than TT(A or G). As a 
test of its authenticity, the amplified fragment hybridized on a Southern blot to an internal 
degenerate oligonucleotide probe Dog probe 1 (SEQ ID N0:7), [based on Canfl residues 17 
to 24, (SEQ ID NO:6)] that had been end labeled with y-32P ATP using T4 polynucleotide 
25 kinase. After subcloning of the amplified fragment into Bluescript KS plasmid vector 

(Stratagene, San Diego, CA), it was sequenced using a Sanger dideoxy termination kit (USB 
Cleveland, OH) and shown to correctly encode residues 16 to 29 of mature £aDLfL 

Similarly, when amino acid sequence analysis of purified Csskll yielded sequence 
information extendmg to residue 94 of the mature protein, new primer pairs were used in 
30 MOPAC PGR amplification of an extended partial Canf l cDNA (residues 14 to 87). The 5' 
or sense primers SA (residues 14 to 20) and SB (residues 21 to 27) were a nested pair based 
on the known Can f I partial cDNA sequence, while the 3' or antisense primers, AS3 A (SEQ 
ID NO:13) (Figure 1) and AS3B (SEQ ID NO:14) (Figure 1) were degenerate 
oligonucleotides based on residues 88 to 94. In sequential rounds of PGR (1/lOOth of the 
35 first reaction was used as template for the second reaction) using conditions described above 
in a pair of successive reactions using nested 5' sense oligos in conjunction with a single 3' 
antisense degenerate primer, a DNA fragment of the predicted size (-3 X 80 amino acids, or 
240 bp) could be amplified. Degenerate3* antisense oligo AS3B was more efficient in 
collaborating with the successive pairs of 5| sense oligos to amplify the partial internal Can f 
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I cDNA than oligo AS3 A, again suggesting that the leucine residue 
rather than TT(A or G). The 240 bp DNA fragment was subcloned into Bluescript KS 
plasmid vector and sequenced as described above. It too proved to be an authentic CaiLf I 
cDNA. The missing residue in the amino acid sequence of Caof 1 at residue 54 was 

5 determined to be an asparagine on the grounds that: 1) no amino acid signal was found at 
residue 54 during protein sequence malysis; and 2) the asparagine residue resides within a 
consensus sequence for N-linked glycosylation (N54 155 T56). These data strongly suggest 
that the N54 residue is modified by N-linked glycosylation. 

To obtain the 3' portion of the CaiLf I cDNA, the RACE (Rapid Amplification of 

10 cDNA ends) PGR protocol was employed (Frohman, M. A. sLaL, (1988) EtosL 

> Tf»tl Acad. Sci. , fiS: 8998 - 9002). First strand cDNA synthesis from total dog parotid RNA 
was carried out as described above, except that the JM3 oligonucleotide was substituted for 
oligo dT as the primer in the reaction. The JM3 primer (SEQ ID NO:22) has an arbitrary 
tract of -40 nucleotides encoded 5' of an oligo dT tract (Figure 2). Hence, upon priming of 

1 5 poly A+ RNA to make cDN A, this known nucleotide tag is covalently linked to the 5* end of 
the nascent cDNA transcripts. Using nested 5' primers, SD (residues 73-79 (SEQ ID NO:15)) 
and SE (residues 80-86 (SEQ ID NO:17)), based on known CanJI cDNA sequence from 
MOP AC PGR analysis and nested primers based on the known JM3 primer sequence (JM3- 1 
(SEQ ID NO;23) and JM3.3 Bam (SEQ ID NO:24)) in PGR amplification as above (except 

20 the PGR program was 40 X [92** G 30 seconds/60*' G 1 minutes/75** G 1 minutes]), a DNA 
fragment --500 bp in length was amplified. When probed against a kinase labeled degenerate 
oligonucleotide. Dog Probe 2 (SEQ ID NO:9) [residues 88 to 94 of mature Cailf 1 (SEQ ID 
NO:8)], this band proved positive for hybridization. Upon subcloning into plasmid vector 
and DNA sequence analysis, three different partial 3' QsnJl cDNAs were identified: Caaf I 

25 (SEQ ID NO:61), 2Can f I (SEQ ID NO:62), and SCaofl (SEQ ID NO:63), each as shown in 
Figure 9. 2 0n f I had a sequence that encoded a methionine residue followed by an 
asparagine-proline pair. These landmark residues for protein sequence analysis predicted: 1) 
a GNBr fragment with the NIC terminal sequence MAKLLGRDPEQ...(SEQ ID NO:64); 2) 
an acid sensitive cleavage site at the DP pan-; and 3) a proline residue which should prove 

30 refractory to OP A treatment and yield amino acid sequence data where all other NH2 termini 
would be blocked by the treatment. Indeed, further protein sequence analysis of purified Can 
f I did identify a GNBr fragment that in conjunction with OPA blockage at the internal 
proline residue had the sequence 

(M)AKLLGRDPEQSQEALEDFOEFSOAKGLNQEILELAQS(E)T (SEQ ID NO:65). Acid 
35 cleavage of the purified protein yielded a peptide with the sequence (D)PEQS(E)EA (SEQ ID 
NO:66), These complimentary data from protein sequence analysis and partial cDNA 
cloning of £an-f I indicated that the authentic 3' end of the Caaf I cDNA may not have been 
isolated. 
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Comparison of the amino acid sequence data from sequencing purified CsnSl and 
those encoded by the partial cDNAs 2Can-f 1 and 3£aaf I inferred the origin of the multiple 
species of 3* cDNAs may have been alternative splicing of the nascent £Iaaf I transcript (note 
how in partial cDNA 3 Canf I residues from the NH2 tenninus of the CNBr fragment are 
5 found linked to the ftagment's COOH terminal residues without the intervening residues). In 
contrast to this hypothesis of multiple cDNAs originating at the level of alternative splicing, 
the above PGR amplification of the 3* end of the Caaf I cDNA produced a single prominent 
DNA fragment --500 bp in length. However, the three partial 3' cDNAs were either 
significantly longer or shorter than 500 bp. This suggested rare partial cDNAs were being 
1 0 subcloned, perhaps because the authentic Qsull cDNA harbored the restrictions site(s) 

encoded at the ends of the primers used in subcloning of DNA fragments that arise from PGR 
amplification. Hence, when digesting the PGR product representing the authentic £ani'I 
cDNA with restriction endonucleases (in this case 5' EcoR I and 3' BamH I) one would 1) cut 
the authentic Can f I cDNA into at least two pieces, and 2) bias towards subcloning rare 
1 5 cDNAs that had arisen firom alternative splicing of the nascent CflaJI RNA transcripts that 
had exons containing EcoR I and/or BamH 1 sites deleted. To address this situation, new 
primers with different restriction enzyme sites at their 5' ends were synthesized and used in 
RACE PGR of the 3' end of the £aXLf I cDNA. The JM3-3 oligo was resynthesized with a 
Bgl II linked to its 5' end [JM3.3XB (SEQ ID NO:21)] (Figure 2), while the 5* primers SD 
20 and SE were resynthesized with Xho I sites at their 5* ends PCSD (SEQ ID N0:1 6) and XSE 
(SEQ ID NO: 18)1 (Figure 2). After nested PGR of JlVD primed total dog parotid cDNA 
using these new primers and the previous amplification conditions, the intact 3' end of the 
Caa_f I cDNA (which hybridized to kinase labeled Dog Probe 4, [(SEQ ID NO:20), residues 
1 15-121 of mature Canf l (SEQ ID N0:19) (Figure 2)] was subcloned and sequenced. The 
25 translated amino acid sequence of the partial cDNA corresponded directly with the protein 
sequence data and extended it a further 6 amino acids before encountering a stop codon. As 
the cloning artifacts had predicted, both EcoR I and BamH I sites were found in the coding 
region of the intact 3' Caaf I cDNA. 

The 5' end of the Can f l cDNA was cloned using an anchored PGR technique (Roux, 
30 K. H. and Dhanarajan, P., (1990) Rintechnigues. g: 48 - 57; Rafiiar, T. £t^, (1991) J, Biol 
Chem. . 266 : 1229 - 1236). Double strand dog parotid cDNA was synthesized using a kit 
(BRL Superscript cDNA Synthesis System) employing the method of RNase H priming of 
the second strand of cDNA synthesis (Gubler, U., and Hof&nan, B, J., (1983) Gene. 25: 263 - 
269). The blxmt double stranded cDNA was ligated to an anchor adapter, thereby placing a 
35 known sequence at the 5' ends of cDNAs (SEQ ID NO:25; SEQ ID NO:26; and SEQ ID 

NO:27) (see Figure 3). A primer based on the anchor sequence was used as a 5' sense primer 
(AP) in conjunction with a nested pair of 3' antisense primers, ASA (SEQ ID NO:3 1 ) 
(residues 1 8 to 24 (SEQ ID NO:30)] and ASB (SEQ ID NO:33) [residues 25 to 30 (SEQ ID 
NO:32)] based on known Caof I cDNA sequence from MOPAC PGR in sequential rounds of 
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"PCR (40*X [92**'C 30 s^^^ l^minutes/75^C l»minutes])ito amplify the 5; end of the 

Canf I cDN A (1 reaction ds anchored cDN A template with 5' AP/3* ASB primers: 2° 
reaction 1/1 00th 1* reaction template with 5' AP/3' ASA primers). Agarose gel 
electrophoresis analysis of the 2° reaction revealed a broad band --300 bp in length, which in 
5 Southem blot analysis hybridized to a 32P kinased degenerate oligonucleotide probe, Dog 
Probe 0 (SEQ ID NO:29) (Figure 3), based on residues 9 to 17 (SEQ ID NO;28) of mature 
CaO-f I- The amplified firagment was subcloned into Bluescript KS plasmid and subjected to 
DNA sequence analysis. It's authenticity as the 5* end of the CaiLf I cDNA was confirmed by 
the presence of the first 13 residues of mature CaoJI protein at the 3' end of the partial 

10 cDNA, Sequence of the longest partial 5* cDNA extended a fiirther 126 bp and encoded a 26 
amino acid leader sequence not found in mature £aai I- Although no in-firame stop codons 
were found 5' of the presumed initiator methionine codon (M-26), it is presumed to be the 
true initiator codon and not just an internal methionine residue because: 1) it is embedded 
within a consensus sequence for translation initiation in manmialian cells (Kozak, M., (1986) 

1 5 Cell , 44: 283 - 292); and 2) the predicted leader sequence is highly homologous to the leader 
sequences of proteins that are highly related to Canf I (see below). 

A contiguous Canf I cDNA was then amplified and both strands directly sequenced 
as a PCR product to confirm the coding sequence of the molecule. To minimize the 
possibility of introducing errors in the amplified cDNA during the PCR reaction, Pfii I DNA 

20 polymerase (Stratagene, San Diego, CA) was used to ampliiy the coding cDNA. Pfu I DNA 
polymerase has been documented to introduce an order of magnitude fewer errors than Taq 
DNA polymerase during PCR (Ltmdberg, K, S., (1 991 ) Qenfi. IM: 1 - 4). Direct sequencing 
of non-cloned DNA fragments fix)m PCR reactions should also obviate any errors made by 
DNA polymerases during PCR since such errors will be scattered at random throughout the 

25 population of PCR products (Gyllensten, U. B., and Ehrlich, H. A.. (1 988) Proc. Natl. Acad. 
Sci. USA. S5: 7652 - 7656), Primers used in the amplification/sequencing included the 5' 
sense leader ex oligo (SEQ ID NO:35) [residues -26 to - 20 of CanJ" I (SEQ ID NO:34)] and 
the 3* antisense stop Bgl 11 oligo (SEQ ID NO:36) [a 24-mer 40 bp 3' of the stop codon of 
Qmll] (Figure 4). A program of 40 X (95'* C 30 seconds/60" C 45 seconds/75° C 45 

30 seconds) was used with the aforementioned primers and Pfii I DNA polymerase to amplify a 
DNA fi:agment --600 bp in length, which was subsequently isolated as a band on a 0.6% low 
melt agarose gel. This gel slice was melted at 70** C and used as template for PCR 
sequencing using 32P labelled oligonucleotides as primers and a commercially available kit 
(AmpliTaq Cycle Sequencing Kit, Perkin Elmer Cetus, Norwalk, CT), A program of 30 X 

35 (95° C 30 seconds/60** or 68** C 30 seconds) was used for the cycle sequencing. The PCR 
sequencing strategy to obtain unambiguous sequence of the mature £aiLf I protein fi-om both 
strands of the amplified cDNA is depicted in Figure 4 with the following sense primers: start 
ex (SEQ ID NO:37); SB (SEQ ID NO:38); SK (SEQ ID NO:39); SE (SEQ ID NO:40); and 
SH (SEQ ID NO:41), and the following antisense primers: Dog 9 (SEQ ID NO:42); ASK 
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(SEQ ID NO:43); ASB (SEQ ID NO:44); and ASJ (SEQ ID NO:45). PGR cycle sequencing 
analysis of amplified cDNA encoding the mature Canf l protein served to confirm the DNA 
sequence obtained previously from cloned partial cDNAs of Caof I, Figure 4. 

In order to infer the possible biological fiinction of QmSh its amino acid sequence 

5 was compared to those in the GenBank, GenBankUpdate, EMBL, and EMBL Update 

sequence data bases (as of 6/25/92) using the NCBI BLAST network service (Altschul, S. F., 
et al.- (1990) J Mol Biol.. 215 : 403 - 410). Caaf I precursor protein (including the signal 
sequence not found in manire CaiLf I protein) displayed strong homology to three proteins: 1) 
Human von Ebner's gland protein; 2) Rat (VEG) von Ebner-s gland protem precursor 

1 0 (Hartwig, S., fiUL, (1 990) NalUie. 342: 366 - 369); and 3) Rat odorant-binding protein 

(Pevsner, J.. sLaL, (1988) Sidfincfi. 241: 336 - 339). von Ebner^s gland is a sublingual gland 
and secretes an abundant protein into the saliva speculated to be involved in potentiating the 
sense of taste involving hydrophobic molecules, von Ebner's gland protein belongs to a 
superfamily of of lipophilic molecule carriers (Godovac-Zimmermann, J., (1988) Irsnds 

1 5 Rinchem. Sci. , H: 64 - 66). The homology between Canf I and the hiunan and rat von 

Ebner's gland proteins indicates that £aiLf I may be the canme homolog of von Ebner's gland 
protein. Additional data indicates that Caaf I mRNA is expressed predominantly in the 
tongue epithelial tissue where von Ebner's glands are localized and only at a very low level 
(not detectable by Northern blot analysis) in parotid glands. 



20 



F.XAMPLE 3 - Protein Scnuence Analysis of Pwrificd Can f II 

Affinity purified Canf II protein was obtained from Dr. Aalberse (de Groot, H. £LaL, 
supra V An Applied Biosystems Model 477A gas phase sequencer with on-line 
phynylthiohydantoin (HTH) amino acid analysis (Model 120A) was used to sequence the 

25 purified Can f II protein. A modification of the extraction program, multiple butylchloride 
extractions, was used to improve the amino acid recovery. O-phthaladehyde (OPA) was used 
in blocking of primary amines when proline was located at the amino terminus. Brauer, 
A.W., et al.. (1984) Rinchemistrv. 121: 134, 142. In situ alkylation was performed by 
using the non-nucleophilic reductant, tributylphosphine with concomitant alkylation by 4- 

30 vinyl pyridine in ethyhnorpholine buffer. Andrews, P.O. and Dixon, J.E., (1987) AfljaL 
Biochemistry. 161: 524 - 528. 

Using this methodology, the sequence of the N-terminus of the Caof II protein was 
determined. The N-terminal sequence of 38 amino acid residues which was identified 
through multiple N-terminal sequence analysis in conjimction with OPA blocking of 

35 contaminating signal represents a novel protein sequence Figure 1 9 (SEQ ID NO: 88). 
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F.X AMPLE 4- F^trflctin n nf mltNA From Canine Parotid Glands. . 

The strategy used to clone CaUufll is schematically drawn in Figure 14. cDNA was 
synthesized using the above preparation as a template in reverse transcription. In the next 
5 step ds cDNA was used as a template for PGR along with degenerate primers which were 
designed based on amino acid isequence of Can f II and oriented to amplify a fragment of Can 
f II cDNA. PGR product was gel purified and than subjected to direct sequencing. The 
nucleotide sequence confirmed that the PGR product represents a fragment of CaiLf H cDNA. 
Further polymerase chain reactions were performed using CaaJfH specific primers in order to 

1 0 obtain a longer fragment which was subsequently used as a probe to screen a dog cDN A 
library. Positive clones were identified, plaque purified, sequenced and fiill length Cm f II 
cDNA was obtained* 

Fresh parotid glands from a single outbred dog were obtained from the Tufts 
University School of Veterinary Medicine (Worcester, MA), washed in phosphate buffered 

15 saline, and immediately frozen on dry ice. RNA was extracted essentially as described in the 
literature (Ghirgwin, J. M. £LaL, (1979) Biochemistry. !&: 5294 - 5299.). Two glands 
(approx. 50 g) were pulverized to a powder with a mortar and pestle frozen in liquid N2, and 
suspended in 25 ml of GTC buffer (50% w/v guanidine thiocynate, 0.5% w/v Na lauryl 
sarcosine, 0.7% v/v b-mercaptoethanol, 0.1% v/v Sigma Antifoam A (Sigma, St. Louis MO), 

20 25 mM Na citrate, pH 7.0) and vortexed until dissolved. Genomic DNA present in the 

solution was sheared by forcing the solution through a 16 gauge needle imtil the viscosity of 
the solution no longer decreased. The sheared solution was ccntrifuged at 3 K rpm for 5 
minutes at room temperature. The supernatant was then sheared further through a 23 gauge 
needle until its viscosity no longer decreased, and cleared by centrifiigation at 5 K rpm for 5 

25 minutes at room temperature. The solution was layered onto a CsCl cushion (5.7 M GsCl, 1 0 
mM EDTA pH 7.5) and centrifiiged in a Beckman SW 41 Ti rotor at 35 K rpm for 1 6 hours 
at 1 T G. The supernatant was discarded and the RNA pellet washed in 70% EtOH then 
resuspended in 0.3 M NaOAc, 10 mM EDTA, 0.1% SDS. Two volumes of absolute EtOH 
were added, and precipitation carried out on diy ice. RNA was pelleted by centrifiigation, 

30 70% EtOH washed, and resuspended in TES (10 mM Tris, 1 mM EDTA, 0.1% SDS). The 
final yield was - 5.3 mg of total RNA, mRNA was isolated from total RNA by 
chromatography on oligo(dT) cellulose using the method described by Aviv, H. and Leder, P. 
rProc. Natl. Acad. Sci. USA . (1972) 62: 1408). 20 ^ig of poly (A) RNA was obtained from 
1.7 mg of total RNA. 

35 The conversion of gland poIy(A) mRNA into double stranded cDNA was carried out 

using standard procedure (Ausubel et al.; (19Q^> 'Current Protocols in Molecular Biolopv. 
John Wiley & Sons). First, poly(A)RNA was copied onto cDNA was using Amersham 
cDNA Synthesis System Plus according to the manufacturer's procedure. 4 |ig of poly (A) 
RNA was used as a template and oligo dT(12-l 8) was used to prime first strand synthesis. 
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The RN A in RNA/DNA hybrid was than removed by RNaseH and the second strand was 
synthesized by DNA Polymerase I. Double stranded cDNA was completed and made blunt 
by T4 DNA polymerase and E, coli DNA ligase according to Gubler, U. and Hoffman, BJ., 
(1983)£isnfi,25:263), 

5 Initially, PGR amplification (Mullis, K.B. and Faloona, F., (1 987) Methods EnTivmol. 

}55 : 355 - 360) was used to obtain a partial cDNA of CaiLf H encoding amino acids 16 to 29 
(SEQ ID NO: 97) of the mature protein. Using dog parotid cDNA as a template with 
degenerate primer pairs (synthesized on an Applied Biosystems 392) based on residues 3 to 6 
(SI A) (Figure 13) (SEQ ID NO: 91) and (SIB) (Figure 13) (SEQ ID NO: 92) and on residues 

10 33 to 38 (ASP2A) (Figure 13) (SEQ. ID NO: 93) and (ASP2B) (Figure 13) (SEQ ID NO: 94) 
of mature Canf II, a DNA fragment of the predicted size (-1 20 bp) was amplified using a 
PGR kit (GeneAmp kit, Perkin Elmer Cetus, Norwalk, CT). Conditions for the reaction 
were: denaturation for 1 minute at 94°C; annealing for 1 minute at 42X and polymerization 
for 1 minute at 72°C. The cycle was repeated 30 times. As a test of its authenticity, the 

1 5 amplified fragment was subjected to direct sequencing using a commercially available kit 
(AmpliTaq Cycle Sequencing Kit, Perkin Elmer Cetus, Norwalk, CT) according to the 
instructions supplied. Primers used in the amplification/sequencing (which included SI A, 
SIB, ASP2A and ASP2B) had been end labeled with Y-32P ATP using T4 polynucleotide 
kinase. The following program of 19 cycles (denaturation at 95^C for 1 minute; annealing 

20 at 50°C for 1 minute and extension at 72'*C for 15 seconds) was used for the cycle 

sequencing in a MJ Research Minicycler. The nucleotide sequence of about 40 nucleotides 
of the fragment was shown to correctly encode residues 16 to 29 of mature CaiLf II (SEQ ID 
NO: 97). The missing residue in the amino acid sequence of a native protein at the position 
26 was found to be asparagine. 

25 In order to generate a CaO-f II specific probe long enough (>1 OObp) to be used to 

screen a cDNA library, the 5' and 3' ends of the Caaf II cDNA were cloned using an 
anchored PGR technique (Roux, K. H. and Dhanarajan, P., (1990) Biotechnigues, i: 48 - 57; 
Rafhar., T, sUL, (1991) T Rinl.Chem., 2fifi: 1229 - 1236) (Figure 15), Double stranded dog 
cDNA was synthesized as described above. The blunt double stranded cDNA was than 

30 ligated to an anchor adapter AT/AL (Figure 16; SEQ ID NO: 96 and 1 02) thereby placing a 
known sequence at 5' and 3* ends of cDNAs (Figure 15A). In order to obtain the 5* end of 
Can f II cDNA, a primer based on the anchor sequence AP2 (Figure 16) (SEQ ID NO: 95) 
was used as a 5' primer in conjunction with 3* antisense primers, D2-1 (residues 22 to 30) 
(Figure 13 and 16) (SEQ ID NO: 74), D2-2 (residues 17 to 25) (Figure 13 and 16) (SEQ ID 

35 NO: 75) and D2-3 (residues 16 to 21) (Figure 13 and 16) (SEQ ID NO: 76) based on known 
CaiLf n cDNA sequence obtauied from initial PCR Sequential rounds of PGR (40 X [92** C 
30 seconds/60** C 1 minute/75** C 1 minute]) were carried out to amplify the 5* end of the Can 
,f II cDNA. In the 1°. reaction, double stranded anchored cDNA was used as a template along 
withS' AP2/3* p2-l. primers; in the 2 creaction.l/20th of the 1** reaction mixture was used 
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with 5' AP2/3* D2-2 primers; in the 3° reaction l/20th of 2^ reaction mixture was used with 
AP2/D2-3 primers). 1% agarose gel electrophoresis of the reaction products revealed the 
presence of a single band of 300 bp long. As expected from the position of primers (see 
Figure 1 5), the 2° and 3° reactions products migrated faster than 1 ^ reaction product. The 
5 amplified fragment from the 3** reaction was gel purified and subjected to DNA sequence 
analysis. It's authenticity as the 5* end of the Caofll cDNA was confirmed by the presence 
of the first N-terminal residues of mature Caof U protein (Figure ISA, shaded residues). The 
5* portion of cDNA (Figure ISA) (SEQ ID NO: 98) encoded part of the amino acid signal 
sequence which was not foxmd in mature £aaIII' The 3' portion of Caaf H (Figure 15B) 

10 (SEQ ID NO: 99) was synthesized in an analogous manner as the 5' end except that single 
stranded cDNA was used as a template and APA (Figure 16) (SEQ ID NO: 100) was used as 
a 3' primer and D2.4,(SEQ ID NO: 77), D2-5 (SEQ ID NO: 78) and 02-6 (SEQ ID NO: 79) 
(Figures 14 and 16) were used as internal primers in PCR. Direct sequencing of the PGR 
product from the 3** reaction revealed the presence of 8 amino acids of the known CaaX II 

1 S sequence followed by 8 amino acids downstream of the known sequence. 

In order to clone the full length Caaf II cDNA, a cDNA library was prepared and 
screened using standard published procedures (Gubler and Hoffrnan, Ausubel at al., supra). 
The lambda cDNA library was custom made by Clontech Laboratories, Inc. as follows: the 
first strand cDNA was primed from poly(A) RNA by oligo d(T)15. The blunt ended double 

20 stranded cDNA was ligated to an Eco RI linker CCGGAATTCCGG (SEQ ID NO: 101), 

digested with EcoRI, size selected in order to obtain fragments larger than SOObp, and ligated 
into EcoRI cut and dephosphotylated vector A.gt 10. The DNA was then packaged into 
lambda particles, plated on C600*hfl and C-600 E.Coli strains and the library titer was 
determined. The unamplified library consisted of 1 .S3xl06 mdependent clones (clear 

25 plaques on C-600 hfl host) which contained inserts ranging in size from 0.6kb to 3-4kb. The 
average size of the insert as determined by PCR using Clontech Xgtl 0 primers was 1 .2 kb. 
100,000 clones were plated on C-600hfl host and screened using CaoJ II specific probe. All 
manipulations leading to the cloning and sequencing of £laaXII cDNA weie done according 
to Protocols in Molecular Biologv (Ausubel filJiL, filipta). A CaiLf II probe was obtained by 

30 PCR amplification of dog cDNA using D2.9 (SEQ ID NO: 80) and D2-13 (SEQ ID NO: 83) 
primers (Figure 16). The PCR product was then 32p labeled by random priming. 20 positive 
clones were plaque purified, phage DNA was extracted from individual clones, digested with 
EcoRI and subcloned into pUCl 8. The presence of inserts was verified by digestion of the 
plasmid DNA with EcoRI and the nucleotide sequence of three individual clones was 

3S determined using Sequenase (United States Biochemicals) and AmpliTaq Cycle Sequencing 
kit (Perkin Elmer Cetus, Norwalk CT) according to manufacturer's instructions. PCR cycle 
sequencing analysis served to resolve some DNA sequence ambiguities resulting most 
probably from the formation of secondary structures on GC-rich CaaXH template. The 
sequencing strategy is depicted in Figure 17. Primers used in the sequencing/amplification 
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included commercially available 16-mer Reverse Sequencing Primer (-21) and 17-mer 
Sequencing Primer (-20) from New England BioLabs as well as the £anjf II specific primers 
listed in Figure 16. 

The nucleotide sequence of the three clones revealed the presence of an open reading 

5 frame which included 38 N-terminal amino acid residues (amino acids 1 to 38 on Figure 
13)(SEQ ID NO: 88) of mature Can f II identified earlier by protein sequencing and PGR 
sequencing of partial cDNA (see above). Sequencing strategy and features of three cDNA 
clones la, Ic and Ij are shown on the Figure 17. Clone Ic of 791 bp (SEQ ID NO: 67) 
encodes the fiill length Can f II precursor protein (including signal sequence) and contains 5' 

10 (bases 1 through 194) and 3* (bases 738 trough 791) imtranslated regions. Clones la of 793 
bp (SEQ ID NO: 69) and 1 j of 774 bp (SEQ ID NO: 71 ), encode precursor CaiLf II proteins 
in which part of the signal sequence is missing and contain 3' untranslated regions which are 
longer then in Ic. The sequence alignment revealed a polymorphism among three clones 
(Figure 18), The nucleotide sequence of 1 a contains one nucleotide substitution (C to T at 

1 5 the position 607) and one deletion (at the position 752) compared to 1 c (Figure 1 8). The 
nucleotide sequence of Ij contains two nucleotide substitutions compared to la and Ic at 
positions 347 (T to C) and 401 (G to T). In addition, the sequence of clones la and Ic differ 
significantly at their 5* and 3' ends. The G to T substitution at the position 401 changes the 
predicted amino acid sequence of CaiLf II at residue 68 from glycine (GGC) to valine 

20 (GTC). All other nucleotide changes do not alter the amino acid sequence of CaiLf II since 
they are either silent mutations or they lie outside of the coding sequence of mature Cani II. 
The polymorphism among the cDNA clones may reflect the expression of Caul II genes 
from different alleles. It may also represent a cloning artifact due to the reverse trancriptase 
mediated synthesis of cDNA which may introduce errors (Holland sLal., (1982) Science, 

25 211: 1577-1585). For example, the purified HIV-1 reverse transcriptase was found to 

introduce misincorporations at a rate of 1/2000 to 1/4000 (Preston fitM-, (1988) Sqignce. 242: 
1 168-1 171), It is also possible that the formation of secondary structures on the GC-rich Can 
f II mRNA template may cause pausing of reverse transcriptase or abnormal termination of 
the synthesis. 

30 The predicted sequence of Can f II protein shown in Figure 1 8 (SEQ ID NO: 68) 

contains a 19 amino acid signal sequence encoded by base 195 through base 251 of the 
cDNA shown in Figure 1 8 (SEQ ID NO: 67). This signal sequence is not found in the 
mature CaiLf II protein vs^iich is encoded by bases 252 through 734. The methionine codon 
at the position -19 is true initiator methionine codon and not just an internal methionine 

35 residue because: 1) the predicted amino acid sequence of a signal peptide (residues -19 to -1) 
is highly similar and identical in length to the signal sequences of proteins that are related to 
Can f II (see below); and 2) although another in-frame methionine codon is found 5' of the 
presumed initiatior metioriirie (position -53) it is unlikely to be true because the deduced 
anuno acid sequence of a peptide starting at the residue -53 is much longer than any known 
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signal sequence and does not show any similarity to any kno>vn signal sequence. The Can f 
II cDNA encodes a protein having a predicted molecular weight of 1 8.2 kDa, with a single 
potential N-linked glycosylation site. Because 1) no amino acid signal was found at residue 
25 during protein sequence analysis, and 2) the asparagine residue resides within a consensus 
5 sequence for N-linked glycosylation (N26 K27 S28), these data strongly suggest that the N26 
residue is modified by N-linked glycosylation, N-linked glycosylation may increase the 
molecular weight of the mature protein. The deduced amino acid sequence of the mature 
protein encoded by the nucleic acid sequence is identical to the known NH2-terminal and 
internal amino acid sequence determined by amino acid sequence analysis of purified £anJ[ H 

10 protein conducted as described in Example 3. 

The expression of Can f II in various tissues was studied using the Northern blot 
technique. Poly (A) RNA or total RNA from various tissues was separated by 
electrophoresis through a 1 .5% agarose gel containing 2.2 M fomialdehyde. (Ausubel sLaL, 
supraV After electrophoresis, the separated RNAs were transferred onto GeneScreen 

1 5 membrane (NEN). Transfer, hybridization with a 32P labeled £aiLf II probe [obtained by 
PGR mediated ampUfication using D2-9 (SEQ ID NO: 14) and 02-13 (SEQ ID NO: 83) 
(Figure 16) primers] and washings of the filter were performed according to the 
manufacturer's instructions. It appeared that the QmlH probe hybridized specifically at high 
stringency to RNA from dog parotid gland and to RNA from tongue epithelial tissue (Figure 

20 20). It did not hybridize to RNA from liver, or submaxiliary gland. Hybridization was 
observed to two bands of about 800 bp and 900 bp long, suggesting that CaafU may be 
encoded by two mRN A species. It is unlikely that two RNAs are transcribed from two 
dififerent genes since a Southern blot experiment suggested that only a single copy Can f II 
geneis present in the dog genome. The two mRN As encoding CaLf II may be due to 

25 alternative splicing or to degradation of the mRN A. The former possibility seems very likely 
since different splicing configurations in the 3' noncoding region has been described for 
proteins which are similar to QmlU (Clark et al„ (1 984) RMBO J.. 2: 1045 - 1052, see also 
below). 

In order to infer the possible biological function of CaiLf II, its amino acid sequence 
30 was compared to those in the GenBank, GenBankUpdate, EMBL, and EMBL Update 

sequence data bases tising the NCBI BLAST network service (Altschul, S. F., et al., (1990) L 
Mol BioL. 215 : 403 - 410). Can f II precursor protein displayed high similarity to two groups 
of related proteins: 1) Mouse Urinary Proteins (MUPs) (Figure 21) (SEQ ID NO: 90) and 2) 
urinary a-2-globulins of rat (A2U) (Figure 21) (SEQ ID NO: 89). The sequences of MUPs 
35 and A2Us show them both to be members of the lipocalin protein family (Cavaggioni et al.. 
(1987) EESS-LetL, 212: 225 - 228). These are small proteins capable of binding 
hydrophobic molecules with high afifmity and selectivity. This family now contains over 20 
different proteins, principally identified through sequence homology (Flower £LaL, (1 991) 
Binchim. Binphvs. Res. Commun. . 180 : 69 - 74). The function of MUP and A2U remains 
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unclear, but it is proposed that rodent urinary proteins are responsible for binding 
pheromones and their subsequent release from drying urine (Bocskei et al.. (1 992) Mature, 
360 : 1 86 - 188). They are synthesized at different levels in the liver and in the submaxillary, 
lachrymal, sublingual, parotid and mammary glands (Shahan fiUdL, (1987) Mol. Cdl. Bioln < 
5 7: 1947 - 1954). MUP IV for example, is expressed predominantly in the lachrymal and 
parotid glands, but not in liver (Shahan et al. , aipEa). The amino acid similarity of £mLf H, 
MUP and A2U as well as their pattern of expression may indicate that CanJH is canine 
homolog of lipocalins- Interestingly, immunologic and biochemical studies of MUPs and 
MUP-related proteins have shown that these proteins are important human allergens (Lonisso 
10 fiUL, (1986) 1 Allegrv CHn. TmmunoL> 2£: 928; Platts-Mills sLaL, (1987) J, Aligrgy Clin, 
ImmunoL, 22: 505; Gurka sLaL, (1 989) J. Allgrgv Clin, Immunpl, fi2: 945 - 954). 

y.VAMPT.E 5; B fletftHal RTpression of Can f 1 

Bacterial expression of Can f I was performed as follows. The vector pETl Id AHR 

1 5 His6 (Novagen, Madison, WI; modified at ImmuLogic Pharmaceutical Corporation by J. P. 
Morgenstem) was modified for expression of £anJf I in E. coli, by removal of the internal 
EcoR I restriction site (at residues E143F144) from the CaiLf I cDNA to be inserted in the 
vector. This modification was necessary since all DNA fragments in this vector are cloned in 
fi^e with the His6 NH2 terminal leader sequence at a mutual 5* EcoR I site. Hence, EcoR 1 

20 sites internal to the insert must be avoided. The pETl Id AHR His6 vector also requires that 
inserts have a 3' BamH I site. However, since restriction sites such as Bgl II and Bel I are 
compatible with BamH I overhangs, they could be placed at the 3' end of the Caof I cDNA, 
avoiding the need to mutate the internal BamH I site. A cDNA encoding the mature Can f I 
protein had its internal EcoR I site removed, a imique EcoR I site placed at its 5' end, and a 

25 Bgl II site placed at its 3' end in a two step PGR reaction (Ho fiLflL, SUCm) using Pfu I DNA 
polymerase to minimize errors during amplification (Figure 6). Two halves of the Cw fl 
cDNA were amplified in primary PCR reactions (template: PGR fragment from cycle 
sequencing, program: 40 X [95° 30 seconds/60° C 45 seconds/75° C 45 seconds]) with the 5' 
portion of the molecule being amplified with the Start ex (SEQ ID NO:46 and SEQ ID 

30 NO:47)/EF antisense (SEQ ID NO:50 and SEQ ID NO:48) primer pair and the 3' portion 
amplified with the sense EF (SEQ ID NO:49 and SEQ ID NO:48)/TAG Bgl II (SEQ ID 
NO:51 and SEQ ID NO:52) primer pair. Both EF primers were designed to introduce a point 
mutation in the EcoR I site at residues E143F144 of CflnJTI from GAATTC to GAGTTG, 
which would maintain the El 43 residue since glutamate can be encoded by GAA or GAG 

35 codons. 

Amplified DNA fragments of the ejcpected size were isolated in gel slices from a 
0.6% low melt agarose gel, melted at 70"' C, mixed and used as template in a secondary (2°) 
PCR reaction with Start ex and TAG Bgl 11 primers, Mutagenized regions bearing the 
' E143F144 pair should hybridize in the initial stages of the reaction to link the 5* and 3' ends 
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of the Can f I cDNA, while the extreme 5* an<i 3' primers should serve to amplify the intact 
mutagenized cDNA. The entire reaction was phenol/chloroform extracted, EtOH 
precipitated, 70% EtOH washed, and digested with EcoR I and Bgl II. A band of the 
expected size (-450 bp) was isolated as a gel slice from a 0.6% low melt agarose gel, melted 
5 at 70* C, ligated at room temperature to EcoR I/BamH I digested pET 1 IdAHR His 6 

plasmid, and the ligation transformed into XL-1 bacteria (Stratagene). Miniprep analysis of a 
3 ml culture of one of the transformed colonies (using a Qiagen [Foster City, CA] plasmid 
mini kit) by Eco RV digestion revealed the presence of an insert of the appropriate size 
within the expression vector. A 300 ml culture seeded with this colony was grown, plasmid 

1 0 DN A extracted (Qiagen plasmid midi kit) and subjected to DNA sequence analysis. The 
entire 453 bp insert was shown to have the correct sequence for mature Can f I cDNA 
(including the mutated E143 codon from GAA to GAG), with the addition of an in-frame 
His6 reporter group (SEQ ID NO: 53) encoded at its 5' end. This His6 reporter group was to 
be used in metal ion affinity purification of the recombinant protein using NTA Ni-H- 

15 chelating resin (Qiagen; Hochuli £LaL, 2yi2£a). 

A single colony of BL21(DE3) pET 1 Id AHR His6£an-f IdRI bacteria was inoculated 
into a 2 ml brain heart infrision (BHI) culture (+200^g/ml ampicillin) and incubated at 37° C 
until turbid but not saturated. At this point 6 \il was removed and added to 600 (xl of BHI and 
mixed. 100 \xl was spread onto each of 6 BHI agar plates (+200 ^g ampicillin) and incubated 

20 overnight at 37° C. The next morning the bacterial lawn was scraped off of the plates» pooled 
and resuspended in 20 ml of BHI media, and then aliquoted 1 ml each into each of 1 8 500 ml 
. BHI cultures (+200 |ig/ml ampicillin) in 2 liter Ehrlenmeyer flasks. Cultures were incubated 
at 37° and shaken at 300 rpm until the A600 reached 1 .0. Isopropyl-B-D- 
thiogalactopyranoside (IPTG) was than added to final concentration of 1 mM to induce 

25 expression of the T7 RNA polymerase gene which would in turn induce expression of 
His6 Can f I protein frx)m the hybrid T7 gnlO/lac 0 promoter. Expression vvras allowed to 
proceed for 2 hours after which the bacteria were pelleted and resuspended in 6 M guanidine 
hydrochloride (GuHCl), 100 mM NaP04, 10 mM Tris, 1 00 mM 2-mercaptoethanol pH 8.0. 
Extraction was carried out for 1 hour with vigorous shaking and terminated by pelleting of 

30 the insoluble material at 1 0 K rpm in a JA-1 0 rotor (Beckman) for 1 hour. Supernatant was 
removed, and its pH adjusted to 8.0 before loading onto a 50 ml NTA agarose column that 
had been equilibrated in 6 M GuHCl, 100 mM NaP04, 10 mM Tris, pH 8.0. The column 
wais washed by step gradient as follows: 1) 6 M GuHCl, 100 mM NaP04, 10 mM Tris, pH 
8.0, 2) 8 M urea, 1 00 mM NaP04, 1 0 mM Tris, pH 8,0, 3) 8 M urea, 1 00 mM NaOAc, 1 0 

35 mM Tris, pH 6.3 vnth each wash proceeding until the A280 of the effluent from the column 
reached background. Recombinant His6£aiLf I protein was eluted from the colunm with 8 M 
urea, 100 mM NaOAc, 10 mM Tris, pH 4.5. Yield of the pooled peak fractions was -100 mg 
with a purity of -80% as determined by densitometry of a sample of the material analyzed by 
SDS-PAGE. 
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K coll transformed with the vector pETl Id containing the nucleic acid encoding Can 
f I have been deposited with the ATCC at accession number 69167. 

EXAMPLE 6: Ma mmalian Expression of Can f 1 Protein 
5 To produce a possibly glycosylated form of recombinant Can f 1 protein expessed in 

mammalian cells, £aiLf I expression was carried out as follows. Full length CmJI protein 
(including the leader sequence not found in the mature protein) when expressed in 
mammalian cells should be properly folded, glycosylated, and secreted. Two systems for 
high level transient expression of recombinant CanJt I were employed. First, transient 

10 expression of recombinant Canf I with a His6 reporter group fused to its COOH terminus 
was performed in NIH 3T3 cells using the pJ7Q expression vector (Morgenstem, J. P. and 
Land, H., (1990) Nuc. Acids Res,. 1068). pJVQ drives expression of genes inserted into 
its polylinker to high levels during transient transfection from its SCMV IE94 promoter 
(Morgenstem and Land, supta)* 

15 A cDNA encompassing the entire Canf I coding sequence was amplified using Pfu I 

mediated PGR of total dog parotid cDNA with the 5' Kozak leader (SEQ ID NO:54)/3' TAG 
Bgl II (SEQ ID NO:51 and SEQ ID NO:52) primer pair (see Figure 7). The entire PGR 
reaction was phenol-chloroform extracted, EtOH precipitated, 70% EtOH washed and 
digested with Xho I and Bgl II to generate correct overhangs for insertion into pJ7Q. A band 

20 of the expected size (-600 bp) to encode the entire CanJf I cDNA was isolated as a gel slice 
on a 0.6% low melt agarose gel, melted at 70° G and ligated to Sal I/Bgl II digested pJ7n at 
room temperature (see Figure 7). The ligation was transformed into competent XL-1 Blue E. 
coli and positive colonies selected on ampicillin (200 ^g/ml) dishes. DNA sequence analysis 
of the 5' and 3* ends of inserts was performed on plasmid obtained from 3 ml cultures of two 

25 colonies (using a Qiagen plasmid mini kit). Both plasmids had inserts with the correct 
sequence of the 5* and 3' ends of full length CanJfL 

Next, to aid in purification of recombinant £aiLf I protein produced in mammalian 
cells (Jankecht, R., sLaL, (1991) Proc. Natl. Acad. Sci. USA, fii; 8972 - 8976), a His6 
reporter group was to be fused at its GOOH terminus. This was accomplished by excising 

30 the DNA fragment encoding the GOOH tenninusofCailfl as an EcoRI- Bgl II fragment 
and exchanging it with an EcoR I -Bgl II fragment encoding the GOOH terminus of the 
protein that had been modified with the addition 6 histidines (Figure 8). The GOOH terminal 
His6 DNA fragment was generated by PGR of overlapping synthetic oligonucleotides as 
follows: a sense oligonucleotide (SEQ ID Np:56) encoding residues E123 to QMS of the 

35 mature Canf I protein (SEQ ID NO:55); Sense 3* His6 link (SEQ ID NO:57); an antisense 
oligonucleotide encoding residues E141 to Q148/a His 6 tract/stop codon (SEQ ID NO:50 
and SEQ ID NO:59); and 3' His6 TAG Bglll (SEQ ID NO:60), were synthesized and purified 
by OPG column chromatography (Applied Biibsystems, Foster Gity, GA). In addition, 
smaller primers composed of the first 24 nucleotides of the aforementioned oligonucleotides, 
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5' His6 link and 3* His6 link, were also synthesized, Linldng and amplifying the two long 
oligonucleotides to generate the EcoR I - Bgl II DNA fragment encoding the£anSl COOH 
tenninus-His6 fusion was performed by PCR, 10 pmoles of each large oligonucleotide were 
used as substrate in Pfu I mediated PCR with 1 jiM primers using the program 40 X (95** C 
5 30 seconds/eO** C 45 seconds/75** C 30 seconds). The entire PCR reaction was phenol- 
chloroform extracted, EtOH precipitated, 70% EtOH washed and digested with EcoR I and 
Bgl II to generate correct overhangs for insertion into pJ7n Cmf L A band of the expected 
size (-1 10 bp) to encode the Canf I COOH tenninus/His6 fusion was isolated as a gel slice 
on a 2.0% NuSieve agarose gel, melted at VO'* C and ligated to EcoR I/Bgl II digested pJ7n 

10 Canf I at room temperature. The ligation was transformed into competent XL-1 Blue E. coli 
and positive colonies selected on ampicillin (200 jig/ral) dishes. Plasmid was isolated from 4 
cultures inoculated with different colonies and subjected to DNA sequence analysis at the 3' 
end of the insert. Clone #3 contained the expected His6 residues linked in frame to the 
COOH terminus of Canf L so a large scale grow of this culture was xmdertaken to obtain 

1 5 large quantities of the pJ7Q Can f l His6 plasmid for transfection. A one liter culture was 
amplified in 15 \iM chloramphenicol once A600 reached 0.6. 800 jig of plasmid were 
isolated after alkaline lysis and two successive rounds of CsCl banding (Sambrook sLaL> 
si^ua). 

Ten plates of NIH 3T3 cells were seeded at a density of 1 ,7 X 106 cells per 15 cm 

20 tissue culture dish, and the following morning, subjected to calcium phosphate transfection 
with 20 KLg/dish of pJ7n Cailil His6 plasmid (Parker, B* A., and Stark, G. R., (1979) 
J. Virol,. 21: 360 - 369). 48 hours post transfection, supernatant was pooled from the dishes, 
filtered through a 0.45 ji unit (Costar), and brought to a concentration of 1 nM imidazole, 
with the addition of protease inhibitors 1 mM PMSF, 1 ^ig/ml, pepstatin 1 fig/ml soybean 

25 trypsin inhibitor, and 1 ng/ml leupeptin. Metal ion affinity purification of the Canf IHis6 
protein was achieved by loading the supernatant onto a 2 ml NTA agarose (Qiagen) colxmui 
that had been equilibrated in 1 X PBS, 1 mM imidazole, 1 mM PMSF, 1 ng/ml pepstatin, 1 n 
g/ml soybean trypsin inhibitor, and 1 ng/ml leupeptin. Non-specifically bound protems were 
washed off the column with 10 column volumes of 1 X PBS, 20 mM imidazole, ImM PMSF, 

30 1 ^g/ml pepstatm, 1 \ig/m\ soybean tiypsin inhibitor, and 1 ^g/ml leupeptin. Cani"! His6 

protein was specifically eluted from the column in 1 X PBS; 80 mM imidazole, 1 mM PMSF, 
1 (ig/ml pepstatin, 1 fig/ml soybean trypsin inhibitor, and 1 ^g/ml leupeptin (Hoffinann, A. 
and Roeder, R- G., (1991) Nnc. Acids, Res., 12: 6337 - 6338, and Janknecht £UL, gypra). 
Aliquots of the eluted fractions were analyzed by 12% SDS PAGE. Coomasie blue staining 

35 of the gel revealed three major bands of molecular weight -70 kDa, 45 KDa, and 25 KDa. 

Since the molecular weight of native immimoaffinity purified Qmll is 25 KDa (Schou et al., 
supra and de Groot et al., supm) it was suspected that the smallest band on the SDS gel was 
recombinant Caaf I His6. 
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RVAMPT.V. 7: Dir ^rt RindinP of Human TgR to Recombinant Can f T 
hy FXISA and Western blot 
An ELISA plate (IMMULON II Dynatech, Chantilly, VA) was coated with 
bacterially expressed recombinant £)aaX I, (r£aiLf I) at 0.5 ^g/weil in PBS-Tween and 
5 incubated overnight at 4°C. The coating antigen was removed and the wells were blocked 
with 0.5% gelatin in PBS, 200 nl/well for two hours at room temperature. Plasma from a 
skin test positive dog allergic patient, #901, was serially diluted with PBS-Tween and 100 ^1 
was added per well and incubated overnight at 4*^0 (the plasma dilutions were tested in 
duplicate). The second antibody (biotinylated goat anti-human IgE 1 :1000, Kirkegaard & 

10 Perry Laboratories) was added at 100 nl/well for one hour at room temperature. This 
solution was removed and streptavidin-HRPO at 1:10,000, (Southem Biotecnology 
Associates, Inc. Birmingham, AL) was added for one hour at room temperature, TMB 
Membrane Peroxidase Substrate system (Kirkegaard & Perry Laboratories) was freshly 
mixed and added at 100 jil. The color was allowed to deyelop for 2-5 minutes. The reaction 

1 5 was stopped by the addition of 100 nl/well of IM phosphoric acid. The plate was read on a 
Mircoplate EL 3 1 0 Autoreader (Biotek Instruments, Winooski, VT) with a 450 nm filter. 
The absorbance levels of duplicate wells were averaged. The graphed results are shown in 
Figure 1 1 . The data shows that patient #901 has a high level of anti-Cani" specific IgE such 
that at 1/162 dilution (the highest plasma dilutin used) the binding level to the recombinant 

20 Canf I is still two-fold above background. A known negative patient (#250) was also tested 
and shown to be negative by this assay. 

Western Blot analysis of four different protein preparations as potential sources Canf 
I was performed. The four different prepparations used for Western blotting were: dog hair 
extract, dog saliva, bacterially expressed rCaof I (used for the ELISA) and iCaaf I as 

25 expressed in a mammalian cell culture system. These preparations were loaded on a 15% 
acrylamide SDS-PAGE (lanes 1-4, respectively) at 5 ^ig/lane. The protein concentrations 
were based on the Bicinchoninic acid (BCA) assay (Pierce, Rockford, IL). Following 
electrophoresis, the proteins were transferred to nitrocellulose and stained with India Ink. 
The nitrocellulose sections were blocked by incubation in Tween solution with 1% milk/1% 

30 BSA for 30 minutes room temperature, then probed with patient #901 plasma or negative 

control patient #250 at a 1 :20 dilution in tween mild solution. This first antibody incubation 
was carried out overnight at room temperature. Biotinylated goat anti-human IgE (KPL) was 
used as the second antibody at a 1 :5000 dilution for a two hour incubation. Streptavidin- 
HRPO (1 :20,000 dilution) and the ECL Western Blot Detection system (Amersham, 

35 Arlington Heights, IL) were used for detection by chemiluminescence. A 20 second 
exposure was performed and the film developed. The results from this assay show no 
recognition of the protein preparations by patient #250 IgE. The IgE fi-om dog allergic 
patient #901 shows distinct binding to Caouf I proteins in the saliva and the bacterially 
expressed recombinant Can f I (lanes 2 and 3, Figure 12). The sizes of the protein forms are 
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diflferent between the two preparations aiid this is due to fact that the native Canf I protein 
found in dog saliva is glycosylated and runs with an apparent molecular weight of 28,000 
daltons whereas the recombinant form from bacteria has no carbohydrate modification. The 
binding of IgE from serum of patients #90 1 to the mammalian expressed rCaoi I is 
5 extremely faint and at present is only suggestive of positive expression. The full length 
bacterially produced form, lane 3, has an apparent molecular weight of 18,000 daltons and 
the larger IgE binding proteins in both lanes 2 and 3 are most likely dimeric structures of the 
lower molecular weight proteins. 

10 fVAMPI^E 8 ; Hflgferia! E\nrcssion of Can f II 

In an attempt to readily produce l^e amounts of pure recombinant CaCLf II protein, 
expression nf Canf II in bacteria was canried out as follows. Full length cDNA encodmg 
mature CaoIU protein was obtained by amplification of the molecule from total parotid 
cDNA m PGR using D2-3pet (SEQ ID NO: 103) and D2.5pet (SEQ ID NO: 104) primer pair 
1 5 (Figure 1 3). Primers were designed to introduce EcoRI and BamHI restriction sites at 5' and 
3' of cDNA molecule respectively. The pETl Id AHR His6 vector requfa^s that inserts have 
5' EcoRI site and 3' BamHI site. 

Amplified DNA fragment of the expected size was purified by electrophpresis in low 
melting agarose and ligated at room temperature to EcoRI/BamHI digested pETl Id AHR 
20 His6 plasmid and the ligation mixture was used to transform XL-1 Blue bacteria (Stratagene). 
Minipiep analysis of several transformed colonies (using Qiagen [Foster City, CA] plasmid 
mini kit) revealed the presence of an insert of the appropriate size within the expression 
vector. A 300 ml culture inoculated with one colony was grown, plasmid DNA extracted an^ 
subjected to sequence analysis. The entire 486 bp insert was shown to have the correct 
25 sequence for the mature Canf II cDNA with the addition of a reporter group encoded at the 5' 
end. This His6 reporter group was to be used in metal ion afiOnity purification of the 
recombinant protein using NTA Ni-H- chelating resin (Qiagen). A single colony of 
BL21(DE3) pET lid AHR His 6Can f II bacteria was inoculated into a 2 ml brain heart 
infusion (BHI) culture (-f200ng/ml ampicillin) and incubated at 37** C until turbid but not 
30 saturated. At this point 6 ^il was removed and added to 600 ^ll of BHI and mixed. 100 \il 
was spread onto each of 6 BHI agar plates (+200 \ig ampiciUin) and incubated overnight at 
37** C. The next morning the bacterial lawn was scraped oflF of the the plates, pooled and 
resuspended in 20 ml of BHI media, and then aliquoted 1 ml each into each of 1 8 500 ml 
BHI cultures (+200 jig/ml ampicillin) in 2 liter Ehrlenmeyer flasks. Cultures were incubated 
35 at 37*^0 and shaken at 300 rpm until the A600 reached 1.0. Isopropyl-fl-D- 

thiogalactopyranoside (IPTG) was than added to 1 mM to induce expression of the T7 RNA 
polymerase gene which would in turn induce expression of His6C2!Lf II protein from the 
hybrid T7 gnlO/lac O promoter. 
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Expression was allowed to proceed for 2 hours after which the bacteria were pelleted 
and resuspended in 6 M guanidine hydrochloride (GuHCl), 1 00 mM NaPCM, 1 0 mM Tris, 
ICQ mM 2-niercaptoethanol pH 8.0. E^ctraction was carried out for 1 hour with vigorous 
shaking and terminated by pelleting of the insoluble material at 10 K ipm in a JA-1 0 rotor 
5 (Beckman) for 1 hour. Supernatant was removed, and its pH adjusted to 8.0 before loading 
onto a 50 ml NTA agarose colunui that had been equilibmted in 6 M GuHCl, 1 00 mM 
NaP04, 10 mM Tris, pH 8.0. The column was washed by step gradient as follows: 1) 6 M 
GuHCl, 100 mM NaP04, 10 mM Tris, pH 8.0, 2) 8 M urea, 100 mM NaP04, 10 mM Tris, 
pH 8.0, 3) 8 M urea, 100 mM NaOAc, 10 mM Tris, pH 6.3 with each wash proceeding imtil 

10 the A280 of the effluent Gcom the column reached background. Recombinant His6£mLf II 
protein was eluted from the column with 8 M urea, 1 00 mM NaOAc, 1 0 mM Tris, pH 4.5. 
Yield of the pooled peak fractions was '-100 mg with a purity of ^80% as determined by 
densitometry of a sample of the material analyzed by SDS-PAGE. . ; 

E. coil transformed with the vector pETl ld,cont£uning the nucleic acid encoding Can 

15 f II have been deposited with the ATCC at accession nuniber XXXX. 

F.XAMPLK 9t Direct Binding of Human IpE to Native and Recombinant Can f II 

Plasma samples from l4 dog-allergic patients (skin test 4+) were assayed for IgE 
binding to native Can f II and rCanf II. An ELISA plate (Inunulon II Dynatech, Chantilly, 

20 VA) was coated with native and bacterially e}q>ressed recombinant £aQ-f II at O.S ^g/well in 
PBS-Tween and incubated overnight at 4° C. The coating antigen was removed and the wells 
were blocked with 0.5% gelatin in PBS, 200 nl/well for two hours at room temperatre. 
Binding of human IgE to the coating antigen was detected using biotinylated goat-anti- 
himian IgE, streptavidin linked to peroxidase and TMB substrate. Reactions were read on a 

25 plate reader at 450 nm (A450). Of 14 plasma samples tested for IgE to native Caof H, 5 
contained detectable antibody binding (Figure 22A), with plasma from patients #901 and 
#227 containg the highest levels. Similarly, of 23 plasma samples tested for IgE to 
recombinant, . bacterially expressed CanX II, several contained detectable antibody binding 
(Figures 22B and C). 

30 

EXAMPI.F. 10: Can f I Human T Cell Proliferation Analysis 

To identify peptides having CanJfl T cell stimulating activity several peptides derived 
from Canf I were produced and cultured Avith human T cell lines primed with recombinant 
Can f I protein and the responses were determined by standard T cell proliferation assays. A 
3 5 set of peptides derived from Canf I (Construct 1 (SEQ ID NO: 1 05), Constract 2 (SEQ ID 
NO: 106), and Construct 3 (SEQ ID NO: 107)), each representing a portion of the Caaf I 
protein were used in proliferation assays. In addition, the assays included two peptides 
(A0095 (SEQ ID NO: 108), amino acids 7 through 19; and A0b96 (SEIQ ID NO: 109); amino 
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acids 42 through 54) which were selected as containing' potential T cell epitopes using the 
algorithm described in Hill et al.. Journal of Immmoloey 147: 184-197. 

Constructs 1, 2 and 3 were produced by expressinjg and purifying portions of the Canf 
I protein in £. co// using standard techniques. DNA fragments encoding Construct 1 (amino 
5 acids 1 through 65 of Canf 1) and Construct 2 (amino acids 56 through 108) were obtained 
by amplification from a plasmid pETl IdA HRHis6£aaf lA containing 
full length Can f l cDNA and were subcloned into an EcoRI/BamHI site of pETl Id vector 
containing a His6 reporter sequence. A DNA fragment encoding Construct 3 (amino acids 90 
through 148) was amplified from the same plasmid and subcloned into an EcoRI site of 

10 pETl 1 d vector. Recombinant proteins were affinity purified on a NTA Ni"*^ chelating resin 
(Qiagen) according to published protocol. 

In order to perforai in vitro assays of human T cell proliferative response to CaiLf I 
and the five Canf I peptides described above, whole blood obtained fipom subjects allergic to 
Can f I in a skin prick test was passed through a Lymphocyte Separation Media (LSM) to 

15 remove platelets, red blood cells and granulocytes. The resulting peripheral blood 

mononuclear cells (PBMC) were stimulated with 50^g/ml of recombinant £aaf I produced 
as described in Example 2 for 6 days in RPMI 1640 medium supplemented with 5% heat 
inactivated human AB serum, 2mM L-glutamine,10mM HEPES, 50pM 2-mercapto-ethanol 
and 1 00 U/ml penicillin and streptomycin. 

20 A second separation LSM was performed to remove high density cell debris and dead 

cells. The resulting PBMC cells were allowed to proliferate for 12-18 days. During this time 
the medium was supplemented with additions of recombinant IL2 (Sunits/ml) and 
IL4 (5units/ml.). 

When the lymphocytes reached a point of rest (determined such that an overnight 
25 pulse of 20,000 cells with thymidine was vvithin the 2000 - 4000 CPM range), the cells 
were restimulated for analysis in secondary proliferation assays. Secondary T cell 
proliferation assays included 2xl04 T cells /well, 5x1 0* PBMC/ well (irradiated with 3500 
rads) as antigen presenting cells. Antigens were assayed in duplicate or triplicate wells at the 
following concentrations: 
30 rCanLfI:4,20andl00fxg/ml. 

Peptides ! 3, 15, and 75 |ig/ml. 

Dog extract : 3, 15 and 75 fig of protein/ml. The concentration of Canjfl in this 
preparation dog extract is unknown. 

Constructs : initially at a single concentration of 20^g/ml, then at 3, 1 5, and75 jig/ml. 
35 Constructs were insoluble at 75|ag/ml. 

PHA was added at l)ig/ml to indicate nonspecific activation ability. Tetanus toxoid, 
an irrelevant antigen, was added at dilutions of 1:2000, 1 :4000 and 1:8000 to indicate T cell 
specificity. 
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After 3 days of culture under conditions of secondary assay l\xCi of thymidine 
was added to each culture well for overnight incubation. Cultures were harvested on glass 
fiber filters and thymidine incorporation was measured by P scintillation counting. 
Stimulation indices which measure the strength of a T cell response to a peptide were 
5 calculated by dividing thymidine uptake of treated cultures by thymidine uptake of 
untreated medium controls. 

The results of the secondary T cell proliferation assays are shown in Figures 24-25. 
Figure 24 graphically compares the stimulation indices of individual subjects to rCanf I, 
peptides A0095-A0096 and Constructs 1-3. This comparison indicates that significant areas 

10 of T cell reactivity in the Can f I protein arc found at all three parts of the protein, as shown 
by the substantial stimulation indices of Constructs 1-3 which, together, encompass the entire 
Can f I protein sequence. 

Positivity indices for the peptides shown in Figure 10 were calculated by multiplying 
the mean T cell stimulation index (Figure 25) by the percent of the tested individuals who 

1 5 had a positive response or a T cell stimulation index of at least two. The percentage of 
positive responders for each tested peptide were as follows: r£aaXI: 89%, A0095: 43%, 
A0096: 43%, Construct 1 : 64%, Construct 2: 73%, Construct 3: 82%. Comparison of 
positivity indices (Figure 10) which measures both the strength of a T cell response to a 
peptide (S.I.) and the frequency of a T cell response to a peptide in a population of dog 

20 dander allergen sensitive individuals indicates that both N terminal (amino acids 1 through 

65) and C terminal (amino acids 90 through ,1 08) ends of the CaaJf I protein contain a number 
of T cell epitopes. 

FOinVALENTS 

25 Although the invention has been described with reference to its preferred 

embodiments, other embodiments can achieve the same results. Those skilled in the art will 
, recognize or be able to ascertain using no more than routine experimentation, numerous 
, equivalents to the specific embodiments described herein. Such equivalents are considered to 
be within the scope of this invention and are encompassed by the following claims. 
30 ■ . . - • '"'^ •- ■-^ 
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SEQUBNCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: * . . 

(A) NAME: IMMULOGIC PHTIRMACEOTICAL CORPORATION 

(B) STREET: 610 Lincoln street 

(C) CITY: Waltham 
10 (D) STATE: MA 

(E) COtTNTRY: USA 

(F) POSTAL CODE (ZIP) : 02154 

(G) TELEPHONE: (617) 466-6000 

(H) TELEFAX: (617)466-6040 



15 



(ii) TITLE OF INVENTION: Allergenic Protein and Peptides from Dog 
^ Dander and Uses There £or 



NUMBER OF SEQUENCES: 109 

COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC coxnpatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: ASCII Text 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(vi) PRIOR APPLICATION DATA: ^ 

(A) APPLICATION NUMBER: USOB/156,549 

(B) FILING DATE: 22 -Nov- 93 

(A) APPLIC^ATION NUMBER: US07/999, 712 

(B) FILING ^DATE : 31-Dec-92 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) ^ NAME : Mandragouras , Amy E . 

(B) REGISTRATION. NUMBER: 36,207 

(C) REFERENCE/DOCKET NUMBER: IMI-026CPPC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (617) 227-5941 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525' base' pairs 
' (B) TYPE: nucleic acid ' 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(iii) 
(iv) 



(ii) MOLECULE TYPE: cDNA 
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15 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1, .525 

5 (ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 79.. 525 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG AAG ACC CTG CTC CTC ACC ATC GGC TTC AGC CTC ATT GCG ATC CTG 4 8 

Met Lys Thr Leu Leu Leu Thr lie Gly Phe Ser Leu lie Ala lie Leu 
-26 -25 -20 -15 

CAG GCC CAG GAT ACC CCA GCC TTG GGA AAG GAC ACT GTG GCT GTQ TCA 96 
Gin Ala Gin Asp Thr Pro Ala Leu Gly Lys Asp Thr Val Ala Val Ser 
-10 -5 1 5 

20 GGG AAA TGG TAT CTG AAG GCC ATG ACA GCA GAC CAG GAG GTG CCT GAG 144 
Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala Asp Gin Glu Val Pro Glu 
10 15 20 

AAG CCT GAC TCA GTG ACT CCC ATG ATC CTC AAA GCC CAG AAG GGG GGC 192 
25 Lys Pro Asp Ser Val Thr Pro Met lie Leu Lys Ala Gin Lys Gly Gly 
25 30 35 

AAC CTG GAA GCC AAG ATC ACC ATG CTG ACA AAT GGT CAG TGC CAG AAC 24 0 

Asn Leu Glu Ala Lys lie Thr Met Leu Thr Asn Gly Gin Cys Gin Asn 
30 40 45 50 

ATC ACG GTG GTC CTG CAC AAA ACC TCT GAG CCT GGC AAA TAC ACG GCA 288 
lie Thr Val Val Leu His Lys Thr ser Glu Pro Gly Lys Tyr Thr Ala 
55 60 65 70 



35 



TAC GAG GGC CAG CGT GTC GTG TTC ATC CAG CCG, TCC CCG GTG AGG GAC 336 
Tyr Glu Gly Gin Arg Val Val Phe lie Gin Pro Ser Pro Val Arg Asp 
75 80 85 



40 CAC TAC ATT CTC TAC TGC GAG GGC GAG CTC CAT GGG AGG CAG ATC CGA 3 84 

His Tyr He Leu Tyr Cys Glu Gly Glu Leu His Gly Arg Gin He Arg 
90 95 100 

ATG GCC AAG CTT CTG GGA AGG GAT CCT GAG CAG AGC CAA GAG GCC TTG 432 
45 Met Ala Lys Leu Leu Gly Arg Asp Pro Glu Gin Ser Gin Glu Ala Leu 
105 110 ' 115 

GAG GAT TTT CGG GAA TTC TCA AGA GCC AAA GGA TTG AAC CAG GAG ATT 480 
Glu Asp Phe Arg Glu Phe Ser Arg Ala Lys Gly Leu Asn Gin Glu He 
50 120 125 130 

TTG GAA CTC GCG CAG AGC GAA ACC TGC TCT CCA GGA GGA CAG TAG 525 
Leu Glu Leu Ala Gin Ser Glu Thr Cys Ser Pro Gly Gly Gin 
135 140 145 

55 
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(2) INFORMATION FOR SEQ ID NO: 2: ,, ,, ^ . . . ^ 

(i) SEQXXENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOI*OGY: linear 

(ii) MOLECULE TYPE: protein 

10 ■ (xi) SEQUENCE DESCRIPTION: SEQ . ID ^NO: 2: 

Met Lys Thr Leu Leu Leu Thr lie Gly Phe.Ser Leu He Ala He Leu 
-26 -25 -20 -15 

15 Gin Ala Gin Asp Thr Pro Ala Leu Gly Lys Asp Thr Val Ala Val Ser 
-10 -5 1 5 

Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala Asp Gin Glu Val Pro Glu 
10 15 20 

20 

Lys Pro Asp Ser Val Thr Pro Met He Leu Lys Ala Gin Lys Gly Gly 
25 30 35 

Asn Ijeu Glu Ala Lys He Thr Met Leu Thr Asn Gly Gin Cys Gin Asn 
25 40 45 .50 

He Thr Val Val Leu His Lys Thr Ser Glu. Pro Gly. Lys^ Tyr Thr Ala 
55 60 . 65 . 70 

- f- i V . ' . . 

30 Tyr Glu Gly Gin Arg Val Val Phe He Gin Pro Ser Pro Val Arg Asp 

75 80 .85 



35 



His Tyr He Leu Tyr Cys Glu Gly Glu Leu His Gly Arg Gin He Arg 
90 BS ^ 100 

Met Ala Lys Leu Leu Gly Arg Asp Pro Glu , Gin i Ser Gin Glu Ala Leu 
105 110 115 



Glu Asp Phe Arg Glu Phe Ser Arg Ala Lys . Gly Leu Asn Gin Glu He 
40 120 125 130 



45 



Leu Glu Leu Ala Gin Ser Glu Thr Cys Ser Pro Gly Gly Gin 
135 140 , 145,. 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A} LENGTH: 7 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear i 

(ii) MOLECULE TYPE: peptide 

55 (v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: ' *' 

Trp Tyr Leu Lys Ala Met Thr 

1 5 - . : * 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single ' 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
20 GGGAATTCTG GTAYTTRGCN ATGAC 25 

(2) INFORMATION FOR SEQ ID NO:S: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE:' nucleic acid- * - 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



30 



35 



40 



45 



50 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGGAATTCTG GTAYCTNAAR GCNATGAC 28 

(2) INFORMATION FOR SEQ ID NO: 6:' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: f8'-'amirio acids • • ' 

(B) TYPE: amino ^^acid ^ 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide - ^ 

(v) FRAGMENT TYPE: internal ^ • ; > 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Gin Glu Val Pro Glu Lys Pro 
15 
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10 



15 



25 



30 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQtTENCE CHARACTERISTICS: 

(A) LENGTH: 23 base ^pairs - . . ^ . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single ' . ^ . 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: CDNA 

. - ^ y \ , . ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 _ 

■ * ^ ^ > If ^ . '. 

GAYCARQARQ TNCCDGARAA RCC 23 

■ - , f t .:: . . 

:- u ' ;■ 

(2) INFORMATION FOR SEQ ID NO:B: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Tyr lie Leu Tyr Cys Glu Gly 
15 



35 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ,ID NO:9: • 
TACATTCTNT AYTGTGARGG 20 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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5 



40 



(v) FRAGMENT TYPE: intiemal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 



Met He Leu Lys Ala Gin Lys Gly 
1 5 



10 (2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 29 base pairs 
<B) TYPE: nucleic acid 
IS (C) STRANDEDNESS : single 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

20 (xi) SEQtJENCE DESCRIPTION: SEQ ID NOrll: 

GGGGATCCYT TYTGNGCYTT YAADATCAT 29 

(2) INFORMATION FOR SEQ ID NO: 12: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iv) ANTISENSE: yes 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGGGATCCYT TYTGNGCYTT YAGDATCAT 29 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid • 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear - . 
50 (ii) MOLECULE TYPE: cDNA 

(iv) ANTISENSE: yes 

..'t': . . ... ... ^ • • , , 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: n 

GGGGATCCYT CACARTAYAA DATRTA 26 
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10 



35 



45 



50 



(2) INFORMATION FOR SEQ ID NO:14: Cni -rfrvii f ... ^„ . 

(1} SEQUENCE CHARACTERISTICS: 

(A) I*ENGTH: 26 base pairs y . ^ * 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single c< ^ v 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: CDNA 
(iv) ANTISENSE: yes 
(Xi) SEQUENCE DESCRIPTION: SEQilD.NO:14r : 
15 GGGGATCCYT CACARTANAG DATRTA ^ ; . 26 

(2) INFORMATION FOR SEQ ID NO:X5: :k c > 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino r acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear - . 

25 (ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Gin Arg Val Val Phe lie 

1 5 - - ^ . ' 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA * ' - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

' ' ' f * • . ■ 

GGGCTCGAGG CCAGCGTGTC GTGTTCATC 29 
(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
55 (B) TYPE: amino acid' ^ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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5 



20 



25 



45 



<v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID,.N0:17: 

Gin Pro Ser Pro Val Arg Asp 
1 5 



10 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 30 base pairs. 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: cDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGGCTCGAGC AGCCGTCCCC GGTGAGGGAC * , 30 

(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

35 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

40 Gin Glu Leu Ala Glu Asp Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single » 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CARGARGCNC TDGARGAYTT 2C 
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<2) INFORMATION FOR SEQ ID NO: 21: 



10 



(i) SEQX7ENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 



15 



(xi) SEQUENCE DESCRIPTION: - SEQ ID NO: 21:. 
GGGGAGATCT CGAGAGGAAG CTGCGGCCGC TGCA 



34 



20 



25 



30 



35 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 22: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single r j 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CGAATACGAC TCACTATAGG AAGCTGCGGC CGCTGCAGTA CTTTTTTTTT 'riTTTri'T 

(2) INFORMATION FOR SEQ ID NO: 23: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 



SB 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

CGAATACGAC TCACTATAGG 



20 
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10 



15 



20 



45 



50 



(2) INFORMATION FOR SEQ ID NO:24: ' ' 

(i) SEQUENCE CHARACTERISTICS: 1 

(A) IiENGTH: 26 base pairs " 

(B) TYPE: nucleic acid ^ 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iv) ANTISENSE: yes 

(xi) SEQUENCE DESCRIPTION:* SEQ IDN0:24: ' 
CCTAGGAGGA AGCTGCGGCC GCTGCA 26 

(2) INFORMATION FOR SEQ ID NO:25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 (xi) SEQXJENCE DESCRIPTION: SEQ ID NO:25: 

GGGTCTAGAG 6TACCGTCCG 20 

35 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GGGTCTAGAG GTACCGTCCG ATCGATCATT 30 

(2) INFORMATION FOR SEQ ID NO:27: 



Ci) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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10 



30 



40 



45 



50 



55 
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(ii) MOLECULE TYPE: CDNA 

(iv) ANTISENSE: yes ' " * * . ' ' 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AATGATCGAT GCT - i' , : . 13 

(2) INFORMATION FOR SEQ ID N0:2a: ^ i 



ix) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

20 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

25 Lys Trp Tyr Leu Lys Ala Met Thr Ala Asp 

1 5 10 * ' 



(2) INFORMATION FOR SEQ ID NO:29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AARTGGTAYC TNAARGCNAT GACAGCAGAC 30 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7 amino acids ^ 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Gin Glu Val Pro Glu Lys Pro 
1 5 
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15 



20 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yeS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGGGATCCAG GCTTCTCAGG CACCTCCTG 29 

(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

30 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

35 Asp Ser Val Thr Pro Met 
1 5 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: -nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
GGGGATCCAT GGGAGTCACT GAGTC 25 
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(2) INFORMATION FOR SEQ ID NO: 34: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid - / 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ^ ID .NO:34 : ... 



15 Met Lys Thr Leu Leu Leu Thr He 
1 5 



(2) INFORMATION FOR SEQ ID NO:35: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CCTCGAGATG AAGACCCTGC TCCTCACCAT C 31 



35 (2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
45 (iv) ANTISENSE: yes 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
50 GGGAGATCTC AGAGGCSTCAT GGAGCTGCTG CCC 33 



(2) INFORMATION FOR SEQ ID ' NOV37 : ' 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(i) 
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(ii) MOUBCOIiE TYPE: cDNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37; 

CCCTCGAGGA CACTGTGGCT GTGTCAGGGA AATGG 35 

10 (2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 18 base pairs 
(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



25 



35 



40 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3B: 
CCTGAGAAGC CTGACTCA t ^® 

(2) INFORMATION FOR SEQ ID NO:39: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
30 (B) TYPE: nucleic. acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
ACTGCATACG AGGGCCAGCG T 21 

(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LBINGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: cDNA 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GGGAATTCCA GCCGTCCCCG GTGA6C3GAC , 



29 



wo 94/16068 



PCTAJS93/12468 



.61- 



10 



15 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIX3GY : linear 

(ii) MOLECUI-E TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

"11 



ATCACCATGC TGACAAATGG T ' 21 

(2) INFORMATION FOR SEQ ID NO:42: ' " ' 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 base pairs ^ t . i 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 

(iv) ANTISENSE: yes 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

AATCTCCTGG TTCAATCCTT T . . 

35 (2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs r . 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: cDNA 
45 (iv) ANTISENSE: yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
50 ACGCTGGCCC TCGTATGCAG T 21 



(2) INFORMATION FOR SEQ ID NO: 44: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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40 



55 



(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:. 
GGGGATCCAT GGGAC3TCACT GAGTC 25 

(2) INFORMATION FOR SEQ ID NO:45: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) IiENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 

(iv) ANTISENSE: yes 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

GTGCAGGACC ACCGTGATGT T 21 

30 (2) INFORMATION FOR SEQ ID NO:46: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal ' . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46; 



Glu Phe Asp Thr Val Ala Val Ser Qly Lys 
45 1 5 10 



(2) INFORMATION FOR SEQ ID NO: 47: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA ' * r.'.Xr ; i : n / . : ' 



wo 94/16068 PCTaJS93/12468 

-63- 

(xi) SEQUENCE DESCRIPTION: SEQ ID N9:4T: 
GGAATTCGAC ACTGTGGCTG TGTCAGGGAA A - ■ • i - 31 



(2) INFORMATION FOR SEQ ID NO: 48: . 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

IS (v) FRAGMENT TYPE: internal 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

■ ■ ■■ ' ■ ■ 

20 Glu Phe ser Arg Ala Lys Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 49: 



or. ^ 



• .1 



^ * } 



(i) SEQUENCE CHARACTERISTICS: - M . . . , r 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid , ... 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

. '* ' ' * ' . ' ■■ ' • . . 

(ii) MOLECULE TYPE: CDNA 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

GAGTTCTCAA GAGCCAAAGG AT 22 

40 (2) INFORMATION FOR SEQ ID NO : 50: , 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid . 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
50 (iv) ANTISENSE: yes 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 
ATCCTTTGGC TATTGAGiAAC TC 



22 
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(2) INFORMATION FOR SEO ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPE: cDNA 
(iv) ANTISEMSE: yes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GGGAGATCTA CTGTCCTCCT GGAGAGCAGG T 31 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Thr Cys Ser Pro Gly Gly Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide ■ ' 

(v) FRAGMENT TYPE: internal • ■ . 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 53: 

Met Gly His His His His His His Glu Phe Asp Thr Val Ala Val 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single ^ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:54: 
CCCCTCGAGC CACATGAAGA CCCTGCTCCT CACCATC * * ' ^ 37 

(2) INFORMATION FOR SEQ ID NO:55: ' - 0 



(i) SEQUENCE CHARACTERISTICS: - ' ' - * '-^ ^ ' ^ 
20 (A) LENGTH: 27 amino acids J 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRACT1ENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



Glu Phe Ser Arg Ala Asri'Lys' Giy Leu Asn Gin Glu lie' Leu Glu Leu 
1 5 ' 10 15 

Ala Gin Ser Glu Thr Cys Ser Pro Gly Gly Gin 
35 2D 25 



(2) INFORMATION FOR SEQ ID NO: 56: 

40 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: .SEQ ID NO: 56: 
GAATTCTCAA GA6CCAAAGG ATTGAACCAG GAGATTTTGG AACTCGCGCA GAGCGAAACC 60 



55 



TGCTCTCCAG GAGGACAG 



78 
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<2) INFORMATION PGR SEQ ID NO: 57: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single . " . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GGGAATTCTC AAGAGCCAAA AGGATT. 26 

(2) INFORMATION FOR SEQ ID N0:5B: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: interxial 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5e: „ 

Glu Thr Cys Ser Pro Gly Gly Gin His His His His Hie His 
1 5 10 



35 (2) INFORMATION FOR SEQ ID N0:S9: " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' 

(ii) MOLECULE TYPE: cDNA 
45 (iv) ANTISENSE: yes ' ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
50 TTTAGATCTA GTGGTGGTGG TGGTGGTGCT GTCCTCCTGG AGAGCAGGTT TC 52 

(2) INFORMATION FOR SEQ ID NO: 60: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iv) ANTISENSE: yes - • 

5 . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
TTTAG^TCTA GTGGTGGTGG TGG 23 



10 



(2) INFORMATION FOR SEQ ID NO:61: 



(1) SEQUENCE CHARACTERISTICS: 
1 S (A) IiENGTH : 14 5 amino acids 

(B) TYPE: amino acid' 
(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQXreNCE DESCRIPTION: SEQ ID NO: 61: . 

Asp Thr Val Ala Val Ser Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala 

1 5 " ^ 10 15 



Asp Gin Glu Val Pro Glu Lys Pro Asp Ser Val Thr Pro Met He Leu 
30 20 25 30 

Lys Ala Gin Lys Gly Gly Asn Leu Glu Ala Lys He Thr Met Leu Thr 
35 40 45 

35 Asn Gly Gin Cys Gin Asn He Thr Val Val Leu His Lys Thr Ser Glu 
50 55 60 

Pro Gly Lys Tyr Thr Ala Tyr Glu Gly Gin Arg Val Val Phe He Gin 
65 70 75 ' 80 

40 

Pro Ser Pro Val Arg Asp Arg Tyr He Leu Tyr Cys Glu Gly Asp Leu 
65 90 95 

Leu Pro Gin Ala His Leu Leu His Pro Ser Cys His His His Ser Leu 
45 100 105 110 

Leu Gin Ala Hie His Arg Leu Leu Leu Pro His Lys Lys Leu Leu Gin 
115 120 125 

50 Gly Asp Pro Cys Val Ala Gin Trp Phe Ser Ala Cys Leu Gly Leu Arg 
130 135 140 

Ala 
145 

55 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IiBNGTH: 144 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOIiOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (v) FRAGMENT TYPE: internal 

(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 62: 

15 Asp Thr val Ala Val Ser Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala 
1 5 10 ^5 



20 



35 



ASD Gin Glu Val Pro Glu Lys Pro Asp Ser Val Thr Pro Met He Leu 
20 25 . 30 

Lys Ala Gin Lys Gly Gly Asn Leu Glu Ala Lys He Thr Met Leu Thr 
35 40 45 



Asn Gly Gin Cys Gin Asn He Thr Val Val Leu His Lys Thr Ser Glu 
25 50 55 60 

Pro Gly Lys Tyr Thr Ala Tyr Glu Gly Gin Arg Val Val Phe He Gin 
65 70 75 80 

30 Pro ser Pro Val Arg Asp His Tyr He Leu Tyr Cys Glu Gly Glu Leu 

85 90 95 



His Gly Arg Gin He Arg Met Ala Lys Leu Leu Gly Arg Asp Pro Glu 
100 105 110 

Gin Ala His His Arg Leu Leu Leu Pro His Lys Lys Leu Leu Gin Gly 
115 120 125 



Asp Pro Cys val Ala Gin Trp Phe Ser Ala Cys Leu Gly Leu Arg Ala 
40 130 135 140 

(2) INFORMATION FOR SEQ ID NO: 63: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid , 
(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: . peptide 

(v) FRAGMENT TYPEs internal 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Pro Glu Lys Pro Asp Ser Val Thr Pro Met He Leu Lys Ala Gin Lys 
1 5 10 15 
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Gly Gly Asn Leu Glu Ala Lys lie Thr Met :I#euQThr.:JAfinoGly6Gln^i?ySi^^^^ (^ j. 
20 25 30 

Gin Asn lie Thr Val Val Leu His Lys Thr Ser Glu Pro Gly Lys Tyr 
5 35 40 45 ~ 

Thr Ala Tyr Glu Gly Gin Arg Val Val Phe lie Gin Pro Ser Pro Val 
50 55 60 

10 Arg Asp His Tyr lie Leu Tyr Cys Glu Gly Glu Leu His Gly Arg Gin 
65 70 75 80 



15 



30 



35 



45 



55 



lie Arg Met Ala Lys Gly Leu Asn oln Glu. He Leu Glu Leu Ala Gin 
85 90 95 

Ser Glu Thr Cys Ser Pro Gly Gly Gin 
100 105 



20 (2) INFORMATION FOR SEQ ID NO: 64: 

• ■ ; y 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 11 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(ii) MOLECUl^ TYPE: peptide 

(v) FRAGMENT TYPE: internal 

<xi) SEQX3ENCE DESCRIPTION: SEQ ID NO: 64 1 



Met Ala Lys Leu Leu Gly Arg Asp Pro Glu Gin 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS!: 
40 (A) LENGTH: 38 amino acids 

(B) TYPE I amino' acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 65: 

50 Met Ala Lys Leu Leu Gly Arg Asp Pro Glu Gin Ser Gin Glu Ala Leu 

1 S ' . ■ ; 10 15 

Glu Asp Phe Glu Phe Ser Ala Lys Gly I^u Asn Gin Glu He Leu Glu 
20 25 30 



Leu Ala Gin Ser Glu Thr 
35 
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(2) INFORMATION FOR SEQ ID NO: 66: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids , ; 
5 (B) TYPE: -amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: peptide 
10 (v) FRAGMENT TYPE: internal - , 

(xi) SEQXJENCE DESCRIPTION: SEQ , ID NO: 66: 

15 Asp Pro Glu Gin Ser Glu Glu Ala 
1 5 



20 



40 



{2} INFORMATION FOR SEQ ID NO: 67 J 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 79X base pairs.' 
<B) TYPE; nucleic acid i. 
(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(ii) MOIiECULE TYPE: cDNA 



30 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 195.. 734 

(ix) FEATURE: 

35 (A) NAME/KEY: mat_peptide 

(B) LOCATION: 253.. 734 



(xi) SEQtffiNCE DESCRIPTION: SEQ ID NO: 67: 

AGAGCTGGAC CCGTGTGTGT GCTGGCCAAT GAGCCCTGGA GGGTCCGGCT CCAGAGTACC 60 

CTCTTGGCAC AGGGCCGAGT CCATCGGGAC AGATGAACCT AGAGGACTCC ACTGCCCTCC 120 

45 CATCCACGGG GCCGGGTCAC CAGACTCTGC AAGTCTCCAG CTGTCGCCAA ACCCAGACAG 180 

AAGGTGCTGT GGAC ATG CAG CTC CTA CTG CTG ACC GTG GGC CTG GCA CTG 230 
Met Gin Leu Leu Leu Leu Thr Val Gly Leu Ala Leu 
-19 -15 -10 

50 

ATC TGT GGC CTC CAG GCT CAG GAG GGA AAC CAT GAG GAG CCC CAG GGA 278 
He Cys Gly Leu Gin Ala Gin Glu Gly Asn His Glu Glu Pro Gin Gly 

-5 , 1 , . 5 

55 GGC CTA GAG GAG CTG TCT GGG AGG TGG CAC TCC 6TT GCC CTG GCC TCC 326 
Gly Leu Glu Glu Leu Ser Gly Arg Trp His Ser Val Ala Leu Ala Ser 
10 15 20 ' 25 
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AAC AAG TCC GAT CTG >ATC AAA CCC vTGG^.GGGrCACj.TTC.AJTC^^Gra TTC ATC 374 
Asn Lys Ser Asp I*eu lie Lys Pro Trp Gly His Phe Arg Val Phe lie 
30 35 40 

5 . • . . . 

CAC AGC ATG AGC GCA AAG GAG GGC AAC CTG CAC GGG GAT ATC CTT ATA 422 
His Ser Met Ser Ala Lys Asp Gly Asn Leu His Gly Asp lie Leu lie 

45 > 50 , 55 

10 CC6 CAG GAC GGC CAG TGC GAG AAA GTC TCC CTC ACT GCG TTC AAG ACT 470 
Pro Gin Asp Gly Gin Cys Glu Lys Val Ser Leu Thr Ala Phe Lys Thr 
60 65 70 

GCC ACC AGC AAC AAA TTT GAC CTG GAG^TAC TGG GGA CAC AAT GAC CTG 518 
15 Ala Thr Ser Asn Lys Phe Asp Leu Glu Tyr Trp Gly His Asn Asp Leu 
75 eo 85 

TAG CTG GCA GAG GTA GAC CCC AAG AGC TAC' CTG ATT CTC TAC ATG ATC 566 
Tyr Leu Ala Glu Val Asp Pro Lys Ser Tyr Leu lie Leu Tyr Met lie 
20 90 55 . 100 105 

AAC CAG TAC AAC GAT GAC ACC AGC CTG GTG GCT CAC TTG ATG GTC CGG 614 
Asn Gin Tyr Asn Asp Asp Thr Ser Leu Val Ala His Leu Met Val Arg 
110 115 120 



25 



40 



GAC CTC AGC AGG CAG CAG GAC TTC CTG CCG GCA TTC GAA TCT GTA TGT 662 
Asp Leu Ser Arg Gin Gin Asp Phe Leu Pro Ala Phe Glu Ser Val Cys 
125 130 13 5 



30 GAA GAC ATC GGT CTG CAC AAG GAC CAG ATT GTG GTT CTG AGC GAT GAC 710 
Glu Asp lie Gly Leu His Lys Asp Gin lie Val Val Leu Ser Asp Asp 
140 145 150 

GAT CGC TGC CAG GGT TCC AGA GAC TAGGGCCTCA GCCACGCAGA GAGCCAAGCA 764 
35 Asp Arg Cys Gin Gly Ser Arg Asp . > , 

155 160 

GCAGGATCTC ACCTGCCTGA GTACGGT 791 



(2) INFORMATION FOR SEQ ID NO: 68: 



( i ) SEC3UENCE CHARACTERISTICS : 

(A) LENGTH: 180 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 <xi) SEQUENCE DESCRIPTION: SEQ ID NO 168: 

Met Gin Leu Leu Leu Leu Thr Val Gly Leu Ala Leu He Cys Gly Leu 
-19 -15 -10 -5 

55 Gin Ala Gin Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu 

'1 . ■ • 5 * ^ ^ 10 



Leu Ser Gly Arg Trp His Ser Val Ala Leu Ala Ser Asn Lys Ser Asp 
15 20 25 
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Leu He Lys Pro Trp' Gly His Phe Arg Val Phe He His Ser Met Ser 
30 35 40 . 45 

5 Ala Lys Asp Gly Asn Leu His Gly Asp He Leu He Pro Gin Asp Gly 

' 50 55^ - ; ' 60 . . 



10 



25 



30 



50 



Gin Cys Glu Lys Val Ser. Leu Thr Ala Phe Lys Thr Ala Thr Ser Asn 
65 70 75 

Lys Phe Asp Leu'Glu Tyr Trp Qly His 'Asn Asp Leu Tyr Leu Ala Glu 
BO B5 90 



Val Asp Pro Lys Ser Tyr Leu He Leu Tyr Met He Asn Gin Tyr Asn 
15 95 'lOO " ^ 105 - 

Asp Asp Thr Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg 
110 115 ' 120^ : 125 

20 Gin Gin Asp Phe Leu Pro Ala Phe Glu Ser Val Cys Glu Asp He Gly 

130 135 140 

Leu His Lys Asp Gin He Val! Val Leu Ser Asp Asp Asp Arg Cys Gin 
145 - 150 155 



Gly Ser Arg Asp 
160 

(2) INFORMATION FOR SEQ ID NOi69: 



( i ) SEQUENCE CHARACTERI STICS : v; 

(A) LENGTH: 793 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNBSS : single 

35 (D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



40 {ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3. ,533 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



AC AGC ACC TTC TGT CTG GGT TTG GCA CTG ATC TGT GGC CTC CAG GCT 47 
ser Thr Phe Cys Leu Gly Leu Ala Leu He Cys Gly Leu Gin Ala 
1 5 10 15 

CAG GAG GGA AAC CAT GAG GAG CCC CAG GGA GGC CTA GAG GAG CTG TCT 95 
Gin Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu JjBU Ser 
20 25 30 



55 GGG AGG TGG* CAC TCC GTT GCC CTG GCC TCC AAC AAG TCC GAT CTG ATC 143 
Gly Arg Trp His Ser Val Ala Leu Ala Ser Asn Lys Ser Asp Leu He 
35 40 45 
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AAA CCC TGG GGG OVCuTTCr AGG ,GTT.^TTC- ATC, jCAC, AGC, ATG AGC GCA AAG ^ X91 
Lys Pro Trp GXy His Phe Arg Val Phe lie His Ser Met Ser Ala Lys 
50 55 60 . 

/ " ^ , - ■ . -V . ■ ' 

5 GAG GGC AAC CTG CAC GGG GAT ATC CTT ATA CCG CAG GAC GGC CAG TGC 239 
Asp Gly Asn lieu His Gly Asp lie Leu lie Pro Gin Asp Gly Gin Cys 

65 i / }.70^ • * . - .^.^ , ^ 

GAG AAA GTC TCC CTC ACT GCG TTC AAG ACT GCC ACC AGC AAC AAA TTT 2B7 
10 Glu Lys Val Ser Leu Thr ; Ala , Phe . Lys Thr Ala Thr Ser Asn Lys Phe 
eo 85 90 95' 

GAC CTG GAG TAG TGG GGA CAC AAT GAC CTG TAG CTG GCA GAG GTA GAC 335 
Asp Leu Glu Tyt Trp Gly His Asn^ Asp Leu Tyr Leu Ala Glu Val Asp 
15 100 105 110 

CCC AAG AGC TAC CTG ATT CTC TAC ATG ATC AAC CAG TAG AAC GAT GAC 383 
Pro Lys Ser Tyr Leu lie Leu Tyr Met lie Asn Gin Tyr Asn Asp Asp 

- 115 V ; 120. : ' :125 

20 ' • 

ACC AGC CTG GTG GCT CAC CTG ATG GTC CGG GAC CTC AGC AGG CAG CAG 431 

Thr Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg Gin Gin 
130 135 140 

25 GAC TTC CTG CCG GCA TTC GAA TCT GTA,TGTiGAA GAC ATC GGT CTG CAC 479 
Asp Phe Leu Pro Ala Phe Glu Ser Val Cys Glu Asp lie Gly Leu His 
145 150 155 

AAG GAC CAG ATT GTG GTT CTG AGC GAT GAC GAT CGC TGC CAG GGT TCC 527 
30 Lys Asp Gin He Val Val Leu Ser Asp Asp Asp Arg Cys Gin Gly Ser 
160 165 170 175 

AGA GAC TAGGGCCTCA GCCTACGCAG AGAGCCAAGC AGCAGGATCT CACCTGCCTG 583 
Arg Asp 

AGGACTCAQA CCTATAGGCT CGGGGGACAC CGTACTCAGC TCTGCGTCCC, TCTCTGCGAA 643 

CCCTCCAGGT GATCCCAGCA ACAACACCCA CCTGNGCTTC CATGTGCGGN CCTGTCCfGC 703 

CTGCGCCCAC TCCCTGCCTG GGCAGCCACA CACTCCCCAG CCCCCTGCTA TGGTCCCTCC 763 

TCGCATAATA AAGGACATTC CGTTCAAAAA '^^^ 

(2) INFORMATION FOR SEQ ID NO :70s 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: X77 amino acids 
50 (B) TYPE: amino acid 

JD) TOPOLOGY: linear 



(ii) MOLECULE' TYPE:' protein ^ ' - ' ' - ^ ' . 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Ser Thr Phe Cys Leu Gly' Leu^ Ala Leu Ile^ Cys' Gly. Leu Gin Ala Gin, 
1 ' 5 10 15 
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Qlu Gly Asn His Glu Glu Pro Gin Gly' Gly Leu Glu Glu Leu Ser Gly 

20 - '25 30 

Arg Trp His Ser Val Ala Leu Ala Ser Asn Lys Ser Asp Leu lie Lys 
5 35 '40 45 

Pro Trp Gly His Phe Arg Val Phe lie His Ser. Met Ser Ala Lys Asp 
50 . . 55 60 

10 Gly Asn Leu His Gly Asp lie Leu He Pro Gin Asp Gly Gin Cys Glu 
65 70 75 80 

Lys Val Ser' Leu Thr Ala Phe ' Lys Thr' Ala Thr Ser. Asn.. Lys Phe Asp 

' ■ *■ B5 ' ■ - ' • 4-- 90" • . 95. 

15 ' " ■ 

Leu Glu Tyr Trp Gly His Asn Asp Leu Tyr Leu Ala Glu Val Asp Pro 

100 ' - 105 • . . llOv 

^ ' ' ■ ■ . • /■ ...^1 

Lys Ser Tyr Leu lie Leu Tyr Met He Asn Gin Tyr Asn Asp Asp Thr 
20 . 115 120 125 

Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg Gin Gin Asp 
130 135 140 

25 Phe Leu Pro Ala Phe Glii* Ser Val Cys GluvAsp..Ile iGly Leu His Lys 
145 • ^ 150 ' ' f?'- 155 . / . ,160^ 

Asp Gin He Val Val Leu Ser Asp Asp Asp Arg Cys Gin Gly Ser Arg 
165' 170)/: ,175 

30' ' > "-^ ... , ... 1.^ , • 

Asp 



(2) INFORMATION FOR SEQ ID NO: 71: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 base pairs 

(B) TYPE: nucleic acid 

'= (C) ' STR7VNDEDNESS r single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



45 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..537 

50 <xi) SEQUENCE DESCRIPTION: SEQID;NO:71: ' , , , 

CAG CTC CTA CTG CTG ACC GTG GGC CTG GCA CTG ATC TGT GGC CTC CAG 46 
Gin Leu Leu Leu Leu Thr Val Gly Leu Ala Leu He Cys Gly Leu Gin 

1 5 10 15' ' 

GCT CAG GAG G6A AAC CAT GAG GAG CCC CAG GGA GGC CTA GAG GAG CTG 96 
Ala Gln^ Glu' Gly Asn Hist Glu> Glu Pro Gin Gly Gly, Leu Glu Qlu Leu 

- ' 20 c/. 25' " ' ^ 3d ^' ' 



55 
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TCT GGG AGG TGG CAC TCC GTT ' GCC. CTG GCC iTCC - AAC: AAG, TCCcGAT ,CT 144 
Ser Qly Arg Trp His Ser Val Ala Leu Ala Ser Asn Lya Ser Asp Leu 

35 ' ' ■ ' ■ • ' 40 ' ■ ■45..' 

5 ACC AAA CCC TGG GGG CAC TTC AGG GTT TTC ATC CAC AGC ATG AGC GCA 192 
Thr Lys Pro Trp Gly His ' Phe ^^Arg- Val'*Phe lie His . Ser/Met Ser Ala 
50 55 60 

AAG GAC GTC AAC CTO CAC GGG GAT ATC CTT* ATA CCG CAG GAC GGC CAG 240 
10 Lys Asp Val Asn Leu His Gly Asp lie Leu lie Pro Gin Asp Gly Gin 
65 70 75 80 

TGC GAG AAA GTC TCC CTC ACT GCG TTC AAG ACT GCC ACC AGC AAC AAA 288 
Cys Glu Lys Val Ser Leu Thr Ala Phe Lys Thr Ala Thr Ser Asn Lys 
15 B5 - 90 ' 95 

TTT GAC CTG GAG TAC TGG GGA CAC AAT GAC CTG TAC CTG GCA GAG GTA 336 
Phe Asp Leu Glu Tyr Trp Gly His Asn Asp Leu Tyr Leu Ala Glu Val 
100 - 105 110 



20 



GAC CCC AAG AGC TAC CTG ATT CTC TAC ATG ATC AAC CAG TAC AAC GAT 384 
Asp Pro Lys Ser Tyr Leu lie Leu Tyr Met lie Asn Gin Tyr Asn Asp 
115 120 125 

25 GAC ACC AGC CTG GTG GCT CAC CTG ATG GTC CGG GAC CTC AGC AGG CAG 432 
Asp Thr Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg Gin 
130 135 140 

CAG GAC TTC CTG CCG GCA TTC GAA TCT GTA TGT GAA GAC ATC GGT CTG 480 
30 Gin Asp Phe Leu Pro Ala Phe Glu Ser Val Cys Glu Asp lie Gly Leu ^ 
145 150 155 160 

CAC AAG GAC CAG ATT GTG GTT CTG AGC GAT GAC GAT CGC TGC CAG GGT 528 
His Lys Asp Gin lie Val Val Leu Ser Asp Asp Asp Arg Cys Gin Gly 
35 155 170 175 

TCC AGA GAC TAGGGCCTCA GCCCACGCAG AGAGCCAAGC AGCAGGATCT 577 
Ser Arg Asp 

40 

CACCTGCCTG AGGACTCAGA CCTATAGGCT CGGGGGACAC ' CGTACTCAGC TCTGCGTCCC 637 

TCTCTGCGAA CCCTCCAGGT GATCCCAGCA ACAACACCCA CCTGCGCTTC CATGTGCGGC 697 

45 CCTGTCCAGC CTGCGCCCAC TCCCTGCCTG GGCAGCCACA CACTCCCCAG CCCCCTGCTA 757 

TGGTCCCTCC TCGCATA 774 



50 (2) INFORMATION FOR SEQ ID NO: 72: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENiSTH: 179 amino acids 

(B) TYPE: amino acid 
55 <D) TOPOLOGY: linear/ 

(ii) MOLECULE TYPE:* protein 
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(xi) SEQUENCE DESCRIPTION : ' SEQ ID NO:72: 

\ ■ . ■ ■ - J . • : • . . . 

Gin Leu Leu Leu Leu Thr Val Gly Leu Ala Leu He Cys Gly Leu Gin 
1 5 10 15 

5 ■ ' ' = 

Ala Gin Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu I-eu 
20 25 30 

' Ser Gly Arg Trp His Ser Val Ala Leu Ala Ser Asn Lys Ser Asp Leu 
10 35 ' ' 40 45 . 

Thr Lys Pro Trp Gly His Phe Arg Val Phe He His Ser Met Ser Ala 

50 ' ' 55 . , , . €0 

is Lys Asp Val Asn Leu His Gly Asp He Leu He Pro Gin Asp Gly Gin 
65 70 75 BO 



20 



35 



40 



cys Glu Lys* Val Ser Leu Thr Ala Phe Lys Thr Ala Thr Ser Asn Lys 
, 85 ,90 95 

Phe Asp Leu Glu Tyr Trp Gly His.Asn Asp Leu Tyr Leu Ala Glu Val 
• 100 i . . 105' ! 110 



Asp Pro Lys Ser Tyr Leu He Leu Tyr Met He Asn Gin Tyr Asn Asp 

25 ^ 115 i ' 120 . , , 125 

Asp Thr Ser Leu Val Ala His Leu Met Val Arg Asp Leu Ser Arg Gin 
130 135 140 

30 Gin Asp Phe Leu Pro Ala Phe Glu Ser Val Cys Glu Asp He Gly Leu 
145 . ISO 155 ' 160 



His Lys Asp Gin He Val Val Leu Ser Asp Asp Asp Arg Cys Gin Gly 

165 , \: 170 175 

Ser Arg Asp 



(2) INFORMATION FOR SEQ ID NO:73: 



(i) SEQUENCE CHARACTERISTICS: . . , 

(A) LENGTH: 998 base pairs 

(B) TYPE:^ nucleic acid _ 

(C) STRANDEDNESS : single 

45 : (D) TOPOLOGY: linear; - ;r . : .r . ; 

(ii) MOLECULE TYPE: cDNA 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:73: 
AGAGCTGGAC CCGTGTGTGT GCTQGCCAAT GAGCCCTGGA GcbTCCGGCT CCAGAGTACC 60 
55 CTCTTGGCAC AGGGCCGAGT CCATCGGGAC AGATGAACCT AGAGGACTCC ACTGCCCTCC 120 
CATCCACGGG GCCGGGTCAC CAGACTCTGC AAGTCTCCAG CTGTCGCCAA ACCCAGACAG 180 
AAGGTGCTGT GGACATGCAG CTCCTACWGC ACYKWCYGTC TGGGYYTGGC ACTGATCTGT 240 
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GGCCTCCAGG CTCAGGAGGG AAACCATGAG GAGCCC^GG GAGGCCTaW^GGAGCTOTCT ' 300 

GGGAGGTGGC ACTCCGTTGC CCTGGCCTCC AACAAGTCCG ATCTGAYCAA ACCCTGQGGG 360 

5 

CACTTCAGGG TTTTCATCCA CAGCATGAGC GCAAAGGACG KCAACCTGCA CGGGGATATC 420 

CTTATACCGC AGGACGGCCA GTGCGAGAAA GTCTCCCTCA CTGCGTTCAA GACTGCCACC 480 

10 AGCAACAAAT TTGACCTGGA GTACTGGGGA CACAATGACC TGTACCTGGC AGAGGTAGAC 540 

CCCAAGAGCT ACCTGATTCT CTACATGATC AACCAGTACA ACGATGACAC CAGCCTGGTG 600 

GCTCACYTGA TGGTCCGGGA CCTCAGCAGG CAGCAGGACT TCCTGCCGGC ATTCGAATCT 660 

15 

GTATGTGAAG ACATCGGTCT GCACAAGGAC CAGATTGTGG TTCTGAGCGA TGACGATCGC 720 

TGCCAGGGTT CCAGAGACTA GGGCCTCAGC CYACGCAGAG AGCCAAGCAG CAGGATCTCA 780 

20 CCTGCCTGAG GACTCAGACC TATAGGCTCG GKGGACACCG TACTCAGCTC TGCGTCCCTC 840 

TCTGCGAACC CTCCAGGTGA TCCCAGCAAC AACACCCACC TGCGCTTCCA TGTGCGGCCC 900 

' }■ ' - - . 

TGTCCAGCCT GCGCCCACTC CCTGCCTGGG CAGCCACACA CTCCCCAGCC CCCTGCTATG 960 

GTCCCTCCTC GCATAATAAA GGACATTCCG TTCAAAAA 998 
(2) INFORMATION FOR SEQ ID NO: 74: 



25 



30 (i) SEQXmNCE CHARACTERISTICS: 

(A) IiENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: CDNA 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

GGGGGATCCC AGATCGGACT TATTGGAGGC 30 
(2) INFORMATION FOR SEQ ID NO: 75: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

55 • ; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



, T 



GGGGGATCCG 6AGGCCA6GG CAACGGA 27 
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(2) INFORMATION FOR SEQ ID NO:76: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 



(xi) SEC3UENCE DESCRIPTION: SEQ ID NO: 76: 

15 GGGGGATCCA ACGGAGTGCC ACCTCCC 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE; CDNA 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GGGGGAATTC GAGGAGCTGT CTGGGA6GTG G ' 
(2) INFORMATION FOR SEQ ID NO:7B: 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78; 

GGGGGAATTC AGGTGGCACT CCGTTGCCCT G 
(2) INFORMATION FOR SEQ ID NO: 79: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 



(ii) MOLECDIiE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

GGGGGAATTC GCCCTGGCCT CCAACAAGTC C 31 

(2) INFORMATION FOR SEQ ID'noVb'o: ' : - ? ' ^ r . 

(i) SEQUENCE CHARACTERISTICS: "'^ ■ ' / - . ^ 

(A) LENGTH; 30 base pairs 
10 (B) TYPE: nucleic acid ' v ^ v. , t 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear > . 



15 



35 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQIDNO:80: 

20 GGGGGAATTC GAG6GAAACC ATGAGGAGCC 30 

(2) INFORMATION FOR SEQ ID NO:81: 

<i} SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 21 t>ase pairs 

(B) TYPE: nucleic acid ; ' * * * " ' ' 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' ' • - : 
30 (ii) MOLECULE TYPE: CDNA 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 81: 
GGACTTGTTG GAGGCCAGGG C 21 
<2) INFORMATION FOR SEQ ID NO: 82: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: linear 

45 ' ■ 

(ii) MOLECULE TYPE: cDNA ' ^ . ' , . ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

50 

GGGGGAATTC ATCAAACCCT GGGGGCACTT 30 
(2) INFORMATION FOR SEQ ID NO: 83:- ^ . , 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECOLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GGGGGATCCA AGTGCCCCCA GGGTTTGAT 
10 (2) INFORMATION FOR SEQ ID NO:B4: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



25 



35 



55 



(ii) MOliECDLE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CACGGGGATA TCCTTATACC 
(2) INFORMATION FOR SEQ ID NO:B5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:B5: 

40 GTACAACGAT GACACCAG 

(2) INFORMATION FOR SEQ ID NO:a6: 

(i) SEQUENCE CHARACTERISTICS:' * 
45 (A) IjENGTH: 16 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ 'ID NO:86 ; 
TGTCCCCCGA GCCTAT ^ 
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10 



25 



40 



-81- 

uS^'iNFoiRMjCri^ JL'Xa LrlQ a J. a exH ^io u.tQ 

(i) SEQUENCE CHARACTERISTICS: 

(A) USNGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOI#ECUI*E TYPE: CDNA 



(xi ) SEQUENCE DESCRIPTION : SEQ * ID ' NO : 87 ; 

15 TCCATCTGCT GACCGTGG 

(2) INFORMATION FOR SEQ ID NO: 88 : 

<i) SEQUENCE CHARACTERISTICS:'^ ' 
20 (A) I-ENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear* 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Glu Gly Asn His Glu Glu Pro Gin Gly Gly Leu Glu Glu Leu Ser Gly 
1 5 10 15 

35 Arg Trp His Ser Val Ala Leu Ala Ser Xaa Lys Ser Asp Leu lie Xaa 

20 25 ' 30 



Pro Trp Gly His Phe Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear^ ^ 

(ii) MOLECULE TYPE:, peptide 

50 (v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

55 '/ V * ^ = • ' ■ 

Met Lys Leu lie Leu Leu Leu Leu Cys Leu Gly Leu lie Leu Val Cys 
1 5 . .. 10 15 
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Glx Gly His Ala Glu Glu Ala Asn Ser Glu Arg Gly Asn Leu Asp Val 
20, 25 30 

Asp Lys Leu Asn Gly Asp Trp Phe .Ser lie Val Val Ala Ser Ajsn Lys 
5 35 40 .45 

Arg Glu Lys He Glu Glu Asn Gly Ser Met Arg Val Phe Met Gin His 
50 55 60 

10 He Asp Val Leu Glu Asn Ser Leu Gly Phe Lys Leu Cys He Lys Glu 

65 70 75 80 



15 



30 



35 



40 



45 



Asn Gly Glu Cys Arg Lys Leu Tyr Ser Val Ala Tyr Lys Thr Pro Lys 
85 90 ' 95 

He Gly Glu Tyr Phe Leu Glu Tyr Asp Gly Gly Asn Thr Phe Thr He 
100 . .105 ., . HO 



Leu Lys Thr Asp Tyr Glu Arg Tyr Val Met Phe His Leu Val Asn Val 

20 115 120 r . ;V 

Asn Asn Gly Glu Ala Phe Gin Leu Met Glu Leu Tyr Gly Arg Thr Lys 
130 135 140 

25 Asp Leu Ser Ser Asp He Lys Glu Lys Phe Ala Lys Leu Cys Glu Ala 

145 150 , 155 160 



His Gly He Thr Arg Asp Asn He He Asp Leu Thr Lys Thr Asp Arg 
165 170 175 

. . - '1 

Cys Leu Gin Ala Arg Gly 

. 180 . , , . 'r 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: linear 

<ii) MOLECULE TYIPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION! SEQ ID NO: 90: 



Met Lys Glx Met Leu Leu Leu Leu Cys Leii Gly Leu He Leu Val Cys 
50 1 5 10 15 

Glx Val His Ala Glu Glu Ala Ser Ser Thr Gly Arg Asn Phe Asn Val 
20 25 30 

55 Glu Lys He Asn Gly Glu Trp His Thr He He Lieu Ala Ser Cys Lys 

35 40 45 

Arg Glu Lys He Glu Asp T^n Gly Asn Phe Arg Leu Phe Leu Glu Gin 
50 55 60 
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10 



25 



35 



He His Val Leu Glu Asn Ser Leu Val Leu Lys Phe His Thr Val Arg 

65 70 " ' '7^75 - ^ 80 

Asp Glu Glu Cys Ser Glu Leu Ser Met Val Ala Asp Lys (Thr Glu Lys 

Q5 ' 90'; ■ [ 95 

. . ^ .J . . ' . ' . 

Ala Gly Glu Tyr Ser Val Thr Tyr Asp Gly Phe Asn Thr Phe Thr He 
100 105 . . 110 

Pro Lys Thr Asp Tyr Asp Asn Phe Leu Met Ala His Leu He Asn Glu 

115 ^120 . 125 



Lys Asp Gly Glu Thr Phe Gin Leu Met Gly Leu Tyr Gly Arg Glu Pro 
15 130 135 ■ ' 140 

Asp Leu Met Ser Asp He Lys Glu Arg Phe Ala Gin Leu Cys Glu Glu 
145 ISO 155 160 

20 His Gly He Leu Arg Glu Asn lie He Asp 'Leu Ser Asn Ala Asn Arg 

165 170 175 



Cys Leu Gin Ala Arg Glu 
ISO 

(2) INFORMATION FOR SEQ ID NO: 91: 



<i) SEQUKNCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOliOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQX3ENCE DESCRIPTION: SEQ ID NO: 91: 

40 GGGAATTCAA TCCATCGAAG GAAG 

(2) INFORMATION FOR SEQ ID NO: 92: 

<i) SEQXJENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear . 

50 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

55 

GGGAATTCAA TCGATCGAAG GAAG 
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(2) INFORMATION FOR SEQ ID NO: 93: 

' ' r • r X i * • ^ . : - , , t 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
5 (B) TYPE: .nucleic acid - , ^ 

(C) STRANDEDNESS: .single 

(D) TOPOIiOGY: linear 



10 



30 



40 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

15 CGTACGGAAA GATGCTGACC CCAGATOGG 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 <ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
CTCTGAAAGA TGCTGACCCC AGATCGG 
(2) INFORMATION FOR SEQ ID NO: 95: 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: C0NA 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95; 

GGGTCTAGAG GTACCGTCCG 
(2) INFORMATION FOR SEQ ID NO: 96: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS :^^flingle 
55 (D) TOPOLOGY: 'linear' 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID H0:96: 

GGGTCTAGAG GTACCGTCCG ATCGTCGATC ATT ^ ' '* '* ^ 

5 - ^ . 

(2) INFORMATION FOR SEQ ID NO:^?: ' ; ^ , ' . . . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide ^ ; • . - • : 

15 (v) FRAGMENT TYPE: internal* / . , t / 



20 



25 



35 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 : . • . ; , 

Gly Arg Trp His Ser Val Ala Zieu Ala ;Ser. Lys Ser Asp 
1 5 - ' 10 A- \ \ 



(2) INFORMATION FOR SEQ ID NO: 96: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS:^ single 
30 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 98 : > ; t * . 
ATCTGTGGCC TCCAGGCTCA GGAGGGAAAC CATGAGGAGC CCCAGGG 



40 (2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



AAACCCTGGG GGCACTTCAG GGTTTTCATC CACAGCATGA GCGCA 

55 ' ' 
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10 



30 



40 



45 



(2) INFORMATION FOR SEQ ID NO: 100: . 

(i) SEQXJBNCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X00: 
15 GGGCTCGAGG TCGAGTTTTT TTTTTTTTTT TTGAC . . 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid . , 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQIDNO:101: 
CCGGAATTCC GG 

(2) INFORMATION FOR SEQ ID NO: 102: 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13...base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iv) ANTISENSE: yes 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
50 AATGATCGAT GCT 

(2) INFORMATION FOR SEQ : ID NO: 103: 

( i ) SEQUENCE , CHARACTERISTICS : . ^ 
55 (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



20 



45 



(ii) MOLECDLE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:103; 
GGGGGATCCC TAGTCTCTGG AACCCTG 
(2) INFORMATION FOR SEQ ID. NO: 104: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single, 
15 (D) TOPOIiOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104 
GGGGAATTCG AGGGAAACCA TGAGGAG 
25 (2) INFORMATION FOR SEQ ID NO:105: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

40 Asp Thr Val Ala Val Ser Gly Lys Trp Tyr Leu Lys Ala Met Thr Ala 
1 5 ^ 10 15 

Asp Gin Glu Val Pro Glu Lys Pro Asp Ser Val Thr Pro Met lie Leu 
20 25 ' 30 



Lys Ala Gin Lys Gly Gly Asn Leu Glu Ala Lys lie Thr Met Leu Thr 

35 40 * . ' 45 



Asn Gly Gin Cys Gin Asn lie Thr Val Val Leu His Lys Thr Ser Glu 
50 50 55 ' 60 . 



Pro 

65 



55 (2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IJ5NGTH: 53 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : peptide 
5 (v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 106: 

Thr Val Val Leu His Lys Thr Ser Glu Pro Gly Lys Tyr Thr Ala Tyr 
1 5 10 15 

Glu Gly Gin Arg Val Val Phe He Gin Pro Ser Pro Val Arg Asp His 
20 25 ' 30 

Tyr He Leu Tyr Cys Glu Gly Glu Leu His Gly Arg Gin lie Arg Met 
35 40 45 

Ala Lys Leu Leu Gly 
50 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 

Tyr Cys Glu Gly Glu Leu His Gly Arg Gin He Arg Met Ala Lys 
5 10 15 

Leu Leu Gly Arg Asp Pro Glu Gin Ser Gin Glu Ala Leu Glu Asp Phe 
20 25 30 

Arg Glu Phe Ser Arg Ala Lys Gly Leu Asn Gin Glu He Leu Glu Leu 
45 35 40 45 

Ala Gin Ser Glu Thr Cys Ser Pro Gly Gly Gin Xaa 
50 55 60 

50 (2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

55 (D) TOPOLOGY: linear ... 

(ii) MOLECULE TYPE: peptide 
(v> FRAGMENT TYPE: internal 



10 



15 



20 



25 



30 



Leu 
1 

40 
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15 



20 



(xi) SEQUENCE DESCRIPTION: SBQIDNOilOB: 

Gly Lys Trp Tyr Ijeu Lys Ala Met* Thr Ala 'Asp Gin, Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO':109r - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : '13 amino acids ^ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOI-ECULE TYPE: peptide; 

(v) FRAGMENT TYPE: internal ' 



(xi) SEQUENCE DESCRIPTION: ; SEQ. ID NOrlOS: 



Ala Lys He Thr Met Leu Thr Asn Gly Gin Cys Gin Asn 

25 1 5 ^ ^ ^ : . 10 . : 
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What is claimed is: 

1 . An isolated nucleic acid comprising a nucleotide sequence encoding a peptide 
having an activity of a dog dander allergen, CaD-f 1. 

5 

2. An isolated nucleic acid of claim I , which is a cDNA sequence. 

3. An isolated nucleic acid of claim 2» wherein the cDNA comprises a nucleotide 
sequence shown in Figure 5 (SEQ ID NO; 1). 

10 

4. An isolated nucleic acid of claim 2, wherein the cDN A comprises the coding 
region of a nucleotide sequence shown in Figure 5 (SEQ ID NO: 1). 

5. An isolated nucleic acid of claim 1 , wherein the peptide comprises an amino 
1 5 acid sequence shown in Figure 5 (SEQ ID NO: 2). 

6. An isolated nucleic acid of claim 5, wherein the peptide comprises amino acid 
residues 1-148 of the sequence shown in Figure 5 (SEQ ID NO: 2). 

20 7. An isolated nucleic acid of claim 1 . wherein the peptide is at least 50% 

homologous with a sequence comprising an amino acid sequence shown in Figure 5 (SEQ ID 
NO: 2). 

8. An isolated nucleic acid of claim 1 , wherein the peptide is encoded by a 
25 nucleic acid which hybridizes under high or low stringency conditions to a nucleic acid 

which encodes a peptide comprising an amino acid sequence shown in Figure 5 (SEQ ID 
NO: 2). 

9. An isolated nucleic acid of claim 1, wherein the peptide is at least 10 amino 
30 acids in length. 

10. An isolated nucleic acid of claun 1 , wherein the peptide is at least 20 amino 
acids in length. 

35 11. An isolated nucleic acid of claim 1 , wherein the peptide is at least 25 amino 

acids in length. 

12, An isolated nucleic acid of claim 1 , wherein the peptide is at least 30 amino 
acids in length. 
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13. An isolated DN A comprising a nucleotide sequence encoding a peptide of at 
least 20 amino acid residues or more in length and having at least about 50% homology with 
an amino acid sequence comprising a sequence shown in Figure 5 (SEQ ID NO: 2). 

5 

14. An isolated nucleic acid comprising a nucleotide sequence encoding a peptide 
having an activity of a dog dander allergen, CaafH- 

15. An; isolated nucleic acid of claim 14, which is a cDNA sequence, 

1 6. An isolated nucleic acid of claim 1 5, wherein the cDN A comprises a 
nucleotide sequence shown in Figure 1 8 (SEQ ID NO: 67). 

17. An isolated nucleic acid of claim 15, wherein the cDN A comprises the coding 
15 region of a nucleotide sequence shown in Figure 18 (SEQ ID NO: 67). 

18. An isolated nucleic acid of claim 1 4, wherein the peptide comprises an amino 
acid sequence shown in Figure 1 8 (SEQ ID NO: 68). 

20 1 9. An isolated nucleic acid of claim 1 8, wherein the peptide comprises amino 

acid residues 1-161 of the sequence shown in Figure 18 (SEQ ID NO: 68). 

20. An isolated nucleic acid of claim 14, wherein the peptide is at least 50% 
homologous with a sequence comprising an amino acid sequence shown in Figure 18 (SEQ 

25 "''id NO: 68)/ ''^^'*'^'^'^ ^ • " ' i:"* ■ ' : ^ ' " .^h ■ - 

21 . An isolated nucleic acid of claim 1 4, wherein the peptide is encoded by a 
nucleic acid which hybridizes under high or low stringency conditions to a nucleic acid 
which encodes a peptide comprising an amino acid sequence shown in Figure 1 8 (SEQ ID 

30 NO: 68). 

22. An isolated nucleic acid of claim 14, wherein the peptide is at least about 10- 
20 amino acids in length. 

35 ' 23. An isolated nucleic acid of clahn 14, wherein the peptide is at least about 1 0- 
16 amino acids in length. ' ' 



24. An isolated DN A comprising a nucleotide seqxience encoding a peptide of at 
least about 10-16 amino acid residues or more in length and having at least about 50% 
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homology with an amino acid sequence comprising a sequence shown in Figure 18 (SEQ ID 
NO: 68). J A 

25. A recombinant expression vector comprising the nucleic acid of claim 1 . 

' - 26. ' A recombinant expression vector of claim 25, wherein the nucleic acid is 
cDNA. 

27. A recombinant expression vector of claim 26, wherein the cDNA comprises 
1 0 the nucleotide sequence shown in Figure 5 (SEQ ID NO: 1). 

28. A recombinant expression vector comprising the nucleic acid of claim 14. 

29. A recombinant expression vector of claim 28, wherein the nucleic acid is 

15 cDNA. 

30. A recombinant expression vector of claim 29, wherein the cDNA comprises 
the nucleotide sequence shown in Figure 1 8 (SEQ ID NO: 67). 

20 31. A host cell transfected with the recombinant expression vector of claim 29 

capable of directing the expression of a peptide having an activity of Caaf I. 

32. A host cell of claim 3 1 which is a eukaryotic cell. 

25 33. A host cell transfected with the recombinant expression vector of claim 27 

capable of directing the expression of a peptide having an activity of Caof I. 

34. A host cell of claim 33 which is a eukaryotic cell. 

30 35. A host cell transfected with the recombinant expression vector of claim 28 

capable of directing the expression of a peptide having an activity of Caaf II. 

36. A host cell of claim 35 which is a eukaryotic cell. 

35 37. A host cell transfected with the recombinant expression vector of claim 30 

capable of directing the expression of a peptide having an activity of Cmil H. 

38.- ^ A host cell of claim 37cvvhich is a eukaryotic M^^^ 
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\ ; 39. ' A method of producing a peptide having 
culturing a host cell of claim 3 1 in medium to express the peptide and isolating the peptide 
from the culture. 

5 40. A method of producing a peptide having an activity of £ailJf II, comprising ^ 

culturing a host cell of claim 35 in medium to express the peptide and isolating the peptide 
from the culture. 

41. An isolated peptide having an activity of a dog dander edlergen, Csskll, 
1 0 produced by recombinant expression of a nucleic acid of claim 1 . 

42. An isolated peptide having an activity of a dog dander allergen, QmLfh 
produced by recombinant expression ofa nucleic acid of claim s 

15 43. An isolated peptide having an activity of a dog dander allergen, Can f L 

produced by recombinant expression of a nucleic acid of claim 4. 

44. An isolated peptide having an activity of a dog dander allergen, Canf I, 
produced by recombinant expression of a DNA of claimlSi V ,1 / , , 

20- '''''^''^ ''^''^'''''^''i^'-^'o^ : 

45. An isolated peptide having an activity of a dog dander allergen. Can f I, 
produced by chemical synthesis. 

46. An isolated peptide ofclaim 45 which is at Ieast^aboutl0r20 amino acids in 

25 length. 

47. An isolated peptide of claim 46 which is at least about 10-16 amino acids in 
length. ' * ' ' ' * ■' " ^ • . > ■ ; 

V „•.■ . r. . ^ , , . 

30 48. An isolated peptide having an activity of a dog dander allergen, £aji-f 11,^ . 

produced by recombinant expression of a nucleic acid of claim 14. 

49. An isolated peptide having an activity of a dog dander allergen, CsnSlh 
produced by recombinant expression of a nucleic acid of claim 1 6. 

35 

50. An isolated peptide having an activity of a dog dander allergen, Can f II, 
produced by recombinant expression of a nucleic acid of claim 17. 
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51. An isolated peptide having an activity of a dog dander allergen, Qmf II, 
produced by recombinant expression of a DNA of claim 24. 



52. An isolated peptide having an activity of a dog dander allergen, Caul II, 
5 ' produced by chemical synthesis. 



53. An isolated peptide of claim 52 which is at least about 1 0-20 amino acids m 

length. 

10 54. An isolated peptide of claim 52 which is at least about 10-16 amino acids in 

length. 

55. A modified peptide having an activity of CaiLf I. ' 

j5 56. A modified peptide of claim 55, wherein at least one cysteine residue present 

in the CaiLf I amino acid sequence shown in Figure 5 (SEQ ID.NO: 2) is replaced by another 
amino acid residue. 

57. A modified peptide of claim 56, wherem at least one cysteine residue present 
20 in the Can-f I ammo acid sequence shown in Figure 5 (SEQ ID NO: 2) is replaced by a serine 

residue. 

58. A modified peptide of claim 55, wherein at least one lysine residue is added to 
either the amino or carboxy tenninus or both the amino and carboxy terminus of the peptide. 

25 

59. A modified peptide having an activity of Caaf H. 

60. A modified peptide of claim 59, wherein at least one cysteine residue present 
in the Can f II amino acid sequence shown in Figure 18 (SEQ ID NO: 68) is replaced by 

30 another amino acid residue. 

61 . A modified peptide of claim 60, wherein at least one cysteine residue present 
in the CaiLf II amino acid sequence shown in Figure 18 (SEQ ID NO: 68) is replaced by a 
serine residue. 



35 



62. A modified peptide of claim 59, wherein at least one lysine residue is added to 
either the amino or carboxy terminus or both the amino and carboxy terminus of the peptide. 
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'63. ^ A substantially pure preparation of a pejptideha^ 
dander allergen, I- u - . . - 

64. A substantially pure preparation of a peptide having an activity of a dog 
5 dander allei^eniCaiLfllV' ^ ^ ^ - ^ ' - - -.r rr .- 

65. A composition suitable for phannaceutical administration comprising at least 
one peptide having an activity of Can f I and a pharmaceutically acceptable carrier. 

10 66. A composition of claim 65» wherein the peptide comprises an amino acid 

sequence of Figure 5 (SEQ ID NO: 2). 

67. A composition of claim 66, wherein the peptide comprises amino acid 
residues 1-148 of Figure 5 (SEQ ID NO: 2). 

15 

68. A composition suitable for pharmaceutical administration comprising at least 
on peptide having an activity of Can f II and a pharmaceutically acceptable carrier. 

69. A composition of claim 68, wherein the peptide comprises an amino acid 
20 sequence of Figure 18 (SEQ ID NO: 68). 

70. A composition of claim 69,' wherein the peptide comprises amino acid 
residues 1-161 of Figure 1 8 (SEQ ID NO: 68). 

25 7 1. A method of treating sensitivity to a dog dander allergen in a subject sensitive 

to the allergen, comprising administering to the subject the composition of claim 65. 

72. A method of treating sensitivity to a dog dander allergen in a subject sensitive 
to the allergen, comprising administering to the subject the composition of claim 66. 



30 



35 



73. A method of detecting sensitivity in a subject to a dog dander allergen, 
comprising combining a blood sample obtained from the subject with a peptide of claim 41 , 
under conditions appropriate for binding of blood components with the peptide and 
det^mining the extent to which such binding occurs. 

74. A method of claim 73, wherein the extent to which binding occurs is 
detennined by assessing T cell function, T cell proliferation, B cell function, binding of the 
protein to antibodies present in the blood or a combination thereof. 
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75. A method of treating sensitivity to a dog dander allergen in a subject sensitive 
to the allergen, comprising administering to the subject the composition of claim 68. 

76. A method of treating sensitivity to a dog dander allergen in a subject sensitive 
5 to the allergen, comprising administering to the subject the composition of claim 69. 

77. A method of detecting sensitivity in a subject to a dog dander allergen, 
comprising combining a blood sample obtained from the subject with a peptide of claim 48, 
imder conditions appropriate for binding of blood components with the peptide and 

1 0 determining the cjctent to which such binding occurs. 

78. A method of claim 77, wherein the extent to which binding occurs is 
determined by assessing T cell function, T cell proliferation, B cell function, binding of the 
protein to antibodies present in the blood or a combination thereof 

15 

79. An antibody specifically reactive with a peptide of claim 4 1 . 

80. An antibody of claim 83 which is a monoclonal antibody. 
20 8 1 . An antibody specifically reactive with a peptide of claim 44. 

82. An antibody of claim 8 1 which is a monoclonal antibody, 

83. An antibody specifically reactive with a peptide of claim 48. 

84. . An antibody of claim 83 which is. a monoclonal antibody. 

85. , Anantibody specifically reactive with a peptide of claim 51. 
30 86. An antibody of claim 85 which is a monoclonal antibody. 

87. A T cell clone specifically reactive with a peptide of claim 41 . 

88. A soluble T cell receptor specifically reactive with a peptide of claim 41 . 

89. An antibody specifically reactive with the T cell receptor of claim 88. 

90. The antibody of claim 89 wherein the antibody is a monoclonal antibody. 



35 
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91 . AT cell clone specifically reactive with a peptide of claim 48. 

92. A soluble T cell receptor specifically reactive with a peptide of claim 48. . 
5 93. An antibody specifically reactive with the T cell receptor of claim 92. 

94. The antibody of claim 93 wherein the antibody is a monoclonal antibody. 

95. An isolated peptide having an activity of a dog dander allergen, Cani^I, which 
10 comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 1 05, 

SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, and SEQ ID NO: 109. 
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W Yc I-a ICa M T ^ . 
S lA 5 • -gggaatt ctggta^tt^aa gcfatgac-3 
rcoR X ^ 9 9 t 

c 



W Yc Xif ^ Aa M T ^, 
IB 5 • -gggaatt ctggta'Tct f aa^gcf atgac-3 



D Q E vf pas ^5 
5'-ga^ca^ga^gtfcctga^aa^cc-3* Dog Probe 1 
t g g t g g 

c ^ 



5« -tacattctt"tattgtga3gg-3' Dog Probe 2 
c c ^ a 



M Xa I* K Q KG 

|tt Jcgjgt Jtt^ctagggg-5 'AS 2A 



3 'tactataa 
9 



M la K_ Af Q_ K_G 

A 21 A A 



•tact:atga^tt^cg^.gt^ttfcctagggg-5 'AS 2B 
g ^ j3amH X 



y Xa Y C £ G 
3 'at -tataa^at^acactf cctagggg-5 • AS 3 A 

^ g ^ 9 ^BAXnB X 

Y Xa X*^ y C E G 
3 • at^tat gaf at^acact^cctagggg-5 • AS 3B 
y g t g ^BaaS X 
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G Q R V V F 1 
XS D S • -gggctcgaggccagcgtgtcgtgttcatc-3 • 
Xho X 



Q P S PVR D 
XSE 5 '-gggctcgagcagccgtccccggtgagggac-a ' 
Xho X 



5 • -ca^ga^gc^ctaga^ga^tt-a • Dog Probe 4 

c ^. 



*»a^ 

3 > -acgtcgccggcgtcgaagga? 

3 • -tttttttttttttttttcatgacgtcgccggcgtcgaaggatatcactcagcataagc-S • j-^ 3 

3 • -ggatatcactcagcataagc-5 • JM3 - 1 

3 •-acgtcgccggcgtcgaagga 

. ^a 
BaxoH X 

c-5' jM3-3Bam 



Y la 1. Y C E G 

3 'at^tataaPat^acactf cctagggg-5 • AS 3A 
^ g ^ 9 ^BamH X 



Y la Ii9 r C E G 
3 'at^tatgaf at^acact^cctagggg-5 • AS SB 
^ g ^ ^ ^BamH X 
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Anchor 5 • -gggtctagaggtaccgtccg-3 ' 

AP2 5' -gggtctagaggt:accgtccgatcgatcait-3' . 

jcba J xpn X tagctagtaa- -S'.., 



S--aa^c?gta^c&;gc|a?ga?ag?agac-3-Dog Probe 0 

9 C ^ " ^ 



g-5'ASA g-5'ASB 

3 • -Qtcccccacggactcttcgga^ v: - -3 * -ccgagtcactgagggtac 



E , V . P. , E K , P 



X Xa !• ^ C E G 

3 • at^tat aa^at^acact^cctagggg-5 • AS 3 A 

^ g ^ ^ SaittH X 



T la I-^ C E G 

3 • at^tatgaf at^acact^cctagggg-S ' AS 3B 

r 9 g ^ ^ ^samH X 
^ c 
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10 20 30 40 50 60 

ATGAAGACCCTGCTCCTCACCATCGGCTTCAGCCTCATTGCGATCCTGCAGGCCCAGGAT 

-26 "20 -10 

M K T L L L T I G F S L l A I L Q A Q D 



70 80 90 100 110 120 

ACCCCAGCCTTGG6AAAGGACACTGTGGCTGTGTCAGGGAAATGGTATCTGAAGGCCATG 

1 10 
T P A L G K D T y A V S G K W Y I, K A M 



130 140 ISO 160 170 180 

ACAGCAGACCAGGAGGTGCCTGAGAAGCCTGACTCAGTGACTCCCATGATCCTCAAAGCC 

20 30 
TADQE VPEK P D SVTPM I LKA 



190 200 210 220 230 240 

CAGAAGGGGGGCAACCTGGAAGCCAAGATCACCATGCTGACAAATGGTCAGTGCCAGAAC 

40 ■ 50 
Q K G G N L E A K I T M L T N G Q C Q N 



250 260 270 280 290 300 

ATCACGGTGGTCCTGCACAAAACCTCTGAGCCTGGCTIAATACACGGCATACGAGGGCCAG 

60 ; j . 70 

I T V V I, H K T S P G K Y T A Y ' E G Q 



310 320 330 340 350 360 

CGTGTCGTGTTCATCCAGCCGTCCCCGGTGAGGGACCACTACATTCTCTACTGCGAGGGC 

80 r - 90 
R V V F I Q P S P V R D H Y X L Y C E G 



370 380 390 400 410 420 

GAGCTCCATGGGAGGCaiGATCCGAATGiCX:CAAGCTTCTGGGAAGGGATCCT 

100 110 
E L H. G k[q IRMAKLLGRDP E Q S 
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^•,0 440 450 460 470 '480 

CAAGAGGCCTTGGAG<yVTTTTCGGGAATTCTCAAGAGCCAAAGGATTGAACCAGGA^^ 

120 "° 

B F S R A K G L N Q E I 



Q E A L E D F R 

490 500 510 S20 

TTGGAACTCGCGCAGAGCGAAACCTGCTCTCCAGGAGGACAGTAG 

140; ^ ^ 

L EL AQSETCSPGGQ- 
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% nULTlPLE SEQUENCE ALIGNMENT. * 
Setting of computation parameteri: 



K-tuple value s 1 
Gap penalty s 30 
Window size s 5 
Filtering levels 2.5 
Open, gap cost s 25 > 
Unit gap cost s 25 

Setting o"f other parameters 



The alignment was done on 3 Protein sequences. 

Character to show that a**position in |the alignment is perfectly conserved: 
Character to show that, apposition is pell conserved! 'v'^ ' . t 

Alignment 



CAMFl 

2CANF1 

3CANF1 



DTVAVSGKWYLKAHTADQEVPEKPDSVTPMILKAQKGGNLEAKITMLTNG 50 
DTVAVSGKWYLKAWTADQEVPEKPDSVTPrilLKAQKGGNLEAKITWLTHG 50 
PEKPDSVTPniLKAQKQGNLEAklTIILTNG 30 



CANFl 

2CANF1 

3CANFI 



aCQNITVVLHKTSEPGKYTAYEGQRWFIQPSPVRDRYILYCEGDLUPQA lOO 
QCQNITVVLHKTSEPGKYTAYEGQRWFICIPSPVRDHYILYCEGELHGR- 99 
QCQNITVVLHKTSEPGICYTAYEG0RWFIGPSPVRDHYILYCEGELHGRQ SO 



CWFl 
-2CANF1 
3CANF1 



HLLHPSCHHHSLLQAHHRLLUPHKKLLQGDPCVAQUFSACLGLRA 1^5 
QIRWAKLLGRDPEQAHHRULLPHKKLLQQDPCVAQWFSACUGLRA 144 
IRMAKGLNQEILELAQS ETCSPGGQ 105 



Consensus length: 145 
Identity s 76 ( 52.4?:> 
Similarity: 7 < 4.6:;£) 
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POSITIVITY INDES 



rCAN Fl 



Aoogs 




mean=35.7 
%=89 



A0096 meani^*^|| |f|f I ) \ 



CONST #1 



CONST #2 




mean=14.7 
%=64 



fnean=6.4 
%c73 
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Can/ 2 cDNA CLONING STRATEGY 



poly(A) RNA isolated from dog parotid gland ; 
Double stranded cDNA 

) s; 

PGR using degenerate primers based on Canf 2 sequence 
Agarose gel purify PGR product 



Direct sequencing of the PGR fragment 
Partial nucleotide sequence of Canf 2 
Design Canf 2 specific primers 

I 

PGR using specific primers/cDNA 

' - I--— ■ 

Further sequence of Canf 2 cDNA : 



Generate Canf 2 specific probe 



Screen dog cDNA library 



Full length Canf 2 cDNA 
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Dog ds cDNA with primer/adaptors 




II I II II nil ^T^L 



AP2 



02-1 
02-2 
02-3 



1. Sequential rounds of PCR 
■< 2. ..Sequencing of PGR product 



ICGLQAQBOMMEBP © 

ATCTGTGGCCTCCAGGCTCAGGAGGGAAACCATGAGGAGCCCCAGGQ 5* portlOn Of CDNA 



B 



Dog ss cDNA 



02-4, 
02-5. 
02-6 



I i 1 1 1 1 n I 1 5' 



APA 



1 . Sequential rounds of PCR 

2, Sequencing of PCR product 



3* portion of Dog 2 cDNA 



KPW©MP;KVFIHSMSA 
AAACCCTGGGGGCACTTCAGGQTTTTCATCCACAGCATGAGCGCA 



nlii 
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AP2 5*GGGTCTAGAGGTACCGTCCG3' . 
AT/AL: 5" GGG TOT AGA GGT ACC GTC CGA TCG ATC ATT 3' 

TAGCTAGTAApS' 
G 
G 
3'-T 

APA 

5 GGG CTC GAG GTC GAG ITT TIT TTT TTT TTT TT (GAC) 

D2-1: S'-fiGSGfiAICCCAG ATCGGACTTATTGGAGGC-3' 

D2-2: 5'- nfy^HGATCC GGA GGC CAG GGC AAC GGA-2' 

D2-3: 5'- acanGATrC AAC GGA GTG CCA CCT CCC-3' 

D2-4: S'^SSfiSSAAHCGAGGAGCTGTCTGGG AGGTGG-3' 

> 

D2-5: 5'-fifiQS2AAII.CAGGTGGCACTCCGTTGCCCTG-3' 

D2-6: 5'- oartaGAATTC GCC CTG GCC TCC AAC AAC TCC -3' 

D2-9: 5'- aaaGaAATTC GAG GGA AAC CAT GAG GAG CC -3' 

D2-11: 5'- GGA CTT GTT GGA GGC CAG GGC -3' 

D2-12: 5'- Gr^GGGAATTC ATC AAA CCC TGG GGG CAC TT -3' 

D2-13: 5'- GGGGGATCC AA GTG CCC CCA GGG TTT GAT -3' 

D2-17: 5 CAC GGG GAT ATC CTT ATA CC -3" 

D2-19: 5'- G TAC AAC GAT GAC ACC AG -3* 

D2-20: 5 TGT CCC CCG AGC CTA T -3 ' 

D2-23: 5'- TC CTA CTG CTG ACC GTG G -3' 
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AGAGCTGGACCCX3T 15 

> 75. 



rrrAOTCCATCGGGACavGATGAACCTAGAGGACTCavCTGCC^^ 135 
GotScCAGACTCTGCAAGTCTCCAGCTGTCGCCAAACCCAGACAGAAGGTGCTGTGGAC 195 

ATrCAGCTCCTACTGCTGACCGTGGGCCTGGCACTGATCTGTGGCCTCCAGGCTCAGGAG 255 
M QLLLLTVGLALICGLQAQE 

_j_g ' ■ ■ -10 CiV.'.AJW-- 1 _ . 

GGAAACCATGAGGAGCCCCAGGGAGGCCTAGAGGAGCTGTCTGGGAGGTGGCACTCCGTT 315 
G N H E E P Q G G L E E L,S G R W H S V 

10 20 

GCCCTGGCCrca^CAAGTCCGATCTGATCAAACCCTGGGGGCACTTCAGGGTTTTCATC 375 
ALASNKS DLIKPWGHFRVFI 

30 40 

CACAGCATGAGCGCAAAGGACGGCAACCTGCACGGGGATATCCTTATACCGCAGGACGGC 435 
HSMSAKDGNLH GDILIPQDG 

50 60 

CAGTGCGAGAAAGTCTCCCTCACTGCGTTCAAGACTGCCACCAGCAACAAATTTGACCTG 495 
OCEKVSLTAF KTATSNKFDL 

70 80 

GAGTACTGGGGACACAATGACCTGTACCTGGCAGAGGTAGACCCCAAGAGCTACCTGATT 555 
E YWGHNDLYLAEVDPKSYLI 

90 100 

CTCTACATGATCAACCAGTACAACGATGACACCAGCCTGGTGGCTCACTTGATGGTCCGG 615 
LYMINQYNDDTSLVAHLMVR 

110 120 

GACCTCAGCAGGCAGCAGGACTTCCTGCCGGCATTCGAATCTGTATGTGAAGACATCGGT 675 
DLSRQQDFLPA FESVCED IG 

130 140 

CTGCACAAGGACCAGATTGTGGTTCTGAGCGATGACGATCGCTGCCAGGGTrCCAGAGAC 735 
LHKDQIVVLSDDDRCQGSRD 

150 160 



TAGGGCCTCAGCCAC6CAGAGAGCCAAGCAGCAGGATCTCACCTGCCTGAGTACGGT 
* 



792 
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CANr2 NATIVE 



MQLLIXTVGL ALICGLQAC C GNHEBPQGGL ££LSGRWHSV AXJIS ^ K5DLI 
E GNHEEPQG6L EELSGRWHSV AUVS'^KSDLI 



50 
31 



CANF2 NATIVE 



I(5wGHF5jri HSMSAKDGNL HGDILIPQDG QCEKVSLTAF KTATSNKFDL 



XPWGHPR 



100 
38 



CANF2 NATIVE 



EYWGHNDLYL AEVDPKSYI.I LYMINQYNDD TSLVAHLMVR DLSRQQDFLP 



150 
36 



CANF2 NATIVE 



AFESVCEDI6 LHKDOIVVLS DDDRCQGSRD 



180 
38 
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Untitled-2 Formatted Alignment 



Canf 2 
RAT A2U 
MUP6 MOUSE 



M3— 



LLLirV 



It 



1 1 u c -i 



j^KHEEPQG 6LCBLd3^ SVAiIa[s|4k SD 4S 
:E;INSERGNL DVDKLJ;G:wr SIV\JAS>K^ 49 
EE^SSTGRNF KV£KXf2:W-{ TIIIASIKW: 4B 



C&nf 2 
tUVT A2U 
MUP6 MOUSE 



F X SDV G "TE R b 



ihHSMSAKDG M 
F 4QHIDVLEN 
F LEQIHVLEN 



LfiGDILIPQ 
S I 3FKLCIKE 
£ I 7LKFHTVR 



DGCC3KVSLT 
NGE C UaYSV I 
DEE C 3ELSMV 



ATSNKF 


98 


?KIGEy 


99 


SKAGEY 


9B 



Canf 2 
RAT A2U 
HUP 6 HOUSE 




LII.YMlSrf}YN DDTSLVAlllld ^^LSRQ^it* 148 
VMFHL\N/NN GEAFQLME I f CPpKOLSSDI 149 
LMAHL3K3KD GETFQU4CI f qHlSPDLMjE^ ^48 



Canf 2 
RAT A2U 
MUP6 MOUSE 



LP^ F 3S\| 
KEVF^ 
KEF F F^QI 




3|G(LHKDC I W 
2 ITRD^ I ID 
G ILREi; I ID 



I. SDDC RC DGS 
I FKTi: RC LQA 
LSNA^RCLQA 



re: 



180 
181 
180 



't - 

WO94/K068 24/31 PCTAJS93/12468 



2 S 
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IIIMIH 




09V V 
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Formatted 
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Alignment 
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la cons 
Ic cons 
Ij cons 

Consensus 



AGAGCTGGAC CCGTGTGTGT 6CTGGCCAAT GAGCCCTGGA 6GGTCCGGCT 



50 



50 



la cons 
Ic cons 
Ij cons 

Consensus 

la cons 
le cons 
Ij cons 

Consensus 



ICQ 



CCAGAGTACC CTCTTG6CAC AGGGCC6AGT CCATCGGGAC AGATGAACCT 100 



150 



AGAGGACTCC. ACTGCCCTCC CATCCACGGG GCCGGGTCAC CAGACTCTGC 150 



la cons 
le cons 
Ij cons 

Consensus 



la eons 
Ic cons 
Ij cons 

Consensus 

la cons 
Ic cons 
Ij cons 

Consensus 

la cons 
Ic cons 
Ij cons 

Consensus 

la cons 
Ic cons 
Ij cons 

Consensus 

la cons 
Ic cons 
Ij cons 

Consensus 



AAGTCTCCAG CTGTCGCCAA ACCCAGACAG AAGGTGCTGT GGACATGCAG 



..CTl 
— TG^ 



cTccjAcjisb ACYKvy^arc 



yGGcM*^^^^ ACTGATC rrfT nCCCTCCAGd 250 



^>rriir,nAnnG j^ftiirrkTGAG a ^nrrrri.r.c. GAnnrrTRHR GGfiGCTGTgTl 



r.r,q fir,nTf.r,g A r Trrr:>r>Pf:n rCT^^r-r>rrr x^CAAGTCCG fiTCTGAM CAA| 



nrrrTRBRGCS Cfir^TTrAGGG T T-rTr^TrrA ci^GrftTGftGr GcpAiyfiQgACG^ 




200 
3 

200 



45 
246 
49 



95 
296 
99 

300 



145 
346 
149 

350 



195 
396 
199 

400 



245 
446 
249 

450 
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la cons 
ic eons 
Ij cons 

Consendus 



^^ ^jrrrrCA n^^^^TCAA GACTGCCACC AGCAACAyj>,T TTGftCCTgG?J 



295 
496 
299 

500 



la cons 
Ic cons 
Ij cons 

Consensus 



CTHrTfiGGGA r^r;^TGACC TCTACCTGGC AGAGGTAGAC CCCAAGAGCTi 550 



345 
546 
349 



la -cons 
le cons 
Ij cons 

Consensus 



ArCTGATTCT rTArATGATC AACCAGTACA ACGATGACAC CAGCCTggTG) 



395 
596 
399 

600 



la cons 
Ic cons 
Ij cons 

Consensus 



GCTCAcM r^ft TGGTCCGGGA CCTCAGCAGG CAGCAGgACT TQCTgCCGgcj 650 



445 
646 
449 



la cons 
Ic cons 
Ij cons 

Consensus 



ATTCGAATCT GTATGTGAAG ACATCGGTCT GC ACAAGGAC CAGATTGTGC 



495 
696 
499 

700 



la cons 
Ic cons 
Ij cons 

Consensus 



TTCTGAGCGA TGACGATCGC TGCCAGGGTT CCA GAGACTA GGGCCTCAGC 



545 
746 
549 

750 



la cons 
Ic cons 
Ij cons 

Consensus 



j ^CGCAGAG AGCCAAGCAG CAGGATCTCA C CTGCCTGAG bACTCAGACC 



595 
785 
599 

800 



la cons 
Ic cons 
Ij cons 

Consensus 



TaTAbGC jcG Gt CGGACACCG TACTCAGCTC TGCGTCCCTC TCTGCGAACC 



645 
791 
649 

850 



la cons 
Ic cons 
Ij cons 

Consensus 



695 
791 
699 



CTCCAGGTGA TCCCAGCAAC AACACCCACC TGCGCTTCCA TGTGCGGCCC 900 
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a/c/j Formatted Alignment 

7<5 

la cons 1111111111 1111111111 1 791 

Ic cons 749 

It cons '** ' 

consensus TGTCCAGCCT GCGCCCACTC CCTGCCTGGG CAGCCACACA CTCCCCAGCC 950 

793 

la cons 1111 1111111111 1111111- 791 

Ic cons — ' ; — 774 

ij cons . — • , , 

consensus CCCTGCTATG CTCCCTCCTC GCATAATAAA GGACATTCCG TTCAAAAA 998 
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MEAN STIMULATION INDICES 




CONST #2 
CONST #3 




